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Variants with altered properties 
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FIELD OF THE INVENTION 

The present invention relates to variants (mutants) of parent 
5 Termamyl-like alpha-amylases , which variant has alpha-amylase 
activity and exhibits an alteration in at least one of the 
following properties relative to said parent alpha-amylase: 
substrate specificity, substrate binding, substrate cleavage 
pattern, thermal stability pH, /activity profile, pH/stability 
10 profile, stability towards oxidation, Ca^'" dependency, specific 
activity, and solubility in particular under production and 
application conditions. 

BACKGROUND OF THE INVENTION 

15 Alpha-Amylases ( alpha- 1 , 4 -glucan-4 -glucanohydrolases , E.G. 
3.2.1.1) constitute a group of enzymes, which catalyze 
hydrolysis of starch and other linear and branched 
1,4-glucosidic oligo- and polysaccharides. 



20 concentrations are achieved during fermentation, purification, 

recovery and~in~ product f brmuIa^tTohT — — 

During the fermentation the protein concentration depend on 
the host cell used. In industrial processes the protein 
concentration typically lies from above O.lg/liter fermentation 

25 broth. For high yield recombinant production of alpha-amylases 
in Bacillus sp. the protein concentration may be as high as 
250g/liter fermentation broth. After purification the protein 
concentration may reach levels of about 1000 g/liter. 
Such high concentrations leads to an undesirable precipitation 

30 resulting in lose of active protein. The tendency today, is 
towards products of increasing strength, which make the ability 
to maintain the enzyme in solution of increasing importance. 
Often up- concentration of a protein solutions results in protein 
precipitates, which are difficult to dissolve into active 

35 protein. 

Thus, it is the object of this application to provide alpha- 
amylases with altered properties as defined below, in particular 
increased solubility. 
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BRIEF DISCLOSURE OF THE INVENTION 

The object of the present invention is to provide Termamyl- 
like amylases which variants in comparison to the corresponding 
parent alpha- amylase , i.e., un-mutated alpha-amylase , has alpha- 
5 amylase activity and exhibits an alteration in at least one of 
the following properties relative to said parent alpha-amylase: 
substrate specificity, substrate binding, substrate cleavage 
pattern, thermal stability, pH/activity profile, pH/stability 
profile, stability towards oxidation, Ca^"^ dependency, specific 
10 activity, and solubility, in particular increased solubility 
under production conditions. 




Nomencl^ature 



In the present description and claims, the conventional one- 
15 letter and three-letter codes for amino acid residues are used. 
For ease of reference, alpha-amylase variants of the invention 
are described by use of the following nomenclature: 
Original amino acid(s): position (s): substituted amino acid(s) 

According to this nomenclature, for instance the substitution 
20 of alanine for asparagine in position 30 is shown as: 
Ala3 0Asn ~or ~"^305J 
a deletion of alanine in the same position is shown as: 

AlaSO* or A30* 
and insertion of an additional amino acid residue, such as 
25 lysine, is shown as: 

Ala3 0AlaLys or A3 OAK 

A deletion of a consecutive stretch of amino acid residues, 
such as amino acid residues 30-33, is indicated as (30-33)* or 
A(A30-N33) . 

30 Where a specific a-^^T^yl^se contains a "deletion" in 

comparison with other a-^^riyl^ses and an insertion is made in 
such a position 'this is indicated as: 

*3 6Asp or *3 6D 
for insertion of an aspartic acid in position 36. 
35 Multiple mutations are separated by plus signs, i.e.: 
Ala30Asp + Glu34Ser or A30N+E34S 

representing mutations in positions 30 and 34 substituting 
alanine and glutamic acid for asparagine and serine, 
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respectively. 

When one or more alternative amino acid residues may be 
inserted in a given position it is indicated as 

A3 0N,E or 
5 A3 ON or A3 0E 

Furthermore, when a position suitable for modification is 
identified herein without any specific modification being 
suggested, it is to be understood that any amino acid residue 
may be substituted for the amino acid residue present in the 
10 position. Thus, for instance, when a modification of an alanine 
in position 30 is mentioned, but not specified, it is to be 
understood that the alanine may be deleted or substituted for 
any other -ami-no aei-d, i .-s . , any- one -of : 
R,N,D,A,C,Q,E,G,H, I,L,K,M, F, P, S,T,W, Y, V. 
15 Further, "A30X" means any one of the following substitutions: 

A30R, A30N, A30D, A30C, A30Q, A30E, A30G, A30H, A30I, A30L, 
A30K, A30M, A30F, A30P, A30S, A30T, A30W, A30Y, or A30 V; or in 
short : A3 0R,N,D,C,Q,E,G,H, I,L,K,M, F,P, S,T,W, Y, V. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is an arignment~of ~the ~ar^^^ 
parent Termamyl -like alpha - amylases . The numbers on the extreme 
left designate the respective amino acid sequences as follows: 
1: SEQ ID NO: 4 (SP722) 
25 2: SEQ ID NO: 2 (SP690) 
3 : SEQ ID NO: 10 (BAN) 
4: SEQ ID NO: 8 (BLA) 
5 : SEQ ID NO: 6 (BSG) . 

Figure 2 shows a unit cell of 4 enzyme molecules. Each molecule 
30 has 8 interacting zones . 

DETAILED DISCLOSURE OF THE INVENTION 

The object of the present invention is to provide 
polypeptides, such as enzymes, in particular alpha-amylases , 
35 with an alteration in at least one of the following properties 
relative to said parent polypeptide: substrate specificity, 
substrate binding, substrate cleavage pattern, thermal 
stability, pH/activity profile, pH/stability profile, stability 
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towards oxidation, Ca^"" dependency, specific activity, and 
solubility, in particular under production conditions. The 
properties will be defined further below. 

Polypeptide 

Polypeptides according to the invention include proteins with 
biological activity, antimicrobial activity, and enzymatic 
activity. 

Contemplated enzyme activities include... 

In one prefered embodiment the enzyme is an alpha-amlase , in 
particular a Bacillus or Aspergillus alpha-amylase . In a 
preferred -emiDodiment the -Bacil.l.us^ aXpha-amylas.e is a Termamyl- 
like amylases. 

Polypeptide with Altered Solubility 

A protein (polypeptide) crystal is built up of systematic 
ordered identical units closely packed together in 3 dimensions. 
The unit cells may contain one or more protein molecules and a 
significant amount of water. Also ions like Calcium, Sodium, 
ncfilor ide r ~Sulphat^~ anci~lar^ mol"ecuTe"s""such" as~ 'surf actants- "and- 
substrates - present during crystal growth - can be identified 
in the tertiary structure. Although the single molecules and the 
single atoms are very small, the repeating array of identical 
units facilitates X-ray scattering and thus provide structure 
information, which can be used for protein engineering purposes. 

Relative to crystals, precipitates and aggregates are smaller 
units and the single molecules are more disordered than in the 
crystals. Nevertheless some of the intermolecular interactions 
are the same as is found in a well ordered crystals and thus can 
the tertiary structure (3D structure) and intermolecular 
information be used for designing protein engineered alpha- 
amylase variants with altered properties, in particular with 
reduced tendency of precipitating, i.e., variant with increased 
solubility. 

Due to the spherical and often irregular surface structures 
of proteins, large holes in the unit cell exist, which are 
occupied by more disordered solvents like water. In fact, the 
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large majority of the protein surface is covered with layers of 
water, which is one reason why structures of proteins solved by 
X-ray crystallography are the same as those for proteins in 
solution. Only at a few areas the protein molecules are in 
5 direct contact with each other, but also solvent mediated 
contacts function as the "glue" that binds the crystal together. 

In general the solubility of proteins is affected by organic 
solvents, salts like ammonium sulphate, NaCl and CaCl2, and by pH 
change altering the surface charge of the molecule. These 

10 factors are considered both for enzyme production to keep the 
enzyme in solution and for crystallography experiment to grow 
useful crystals. Large symmetric crystals are grown by slow 
i-ncrease in -prot ein concent ra t ion or -by al t er-ing t he .pro te in 
surface thereby enhancing the inter-molecular contacts. 

15 Not a-11 proteins crystallise in a form usefull for X-ray 

crystallography determination methods, but based on existing 
tertiary structure accurate models can be build if the homology 
between the template molecule and the molecule of interest is 
high enough. 

20 When homologous polypeptides, such as enzymes, in particular 

Te rmamy I~l~ike "'a Ipha - amy la ses ^(de'fi'ned" b^lc5wr^,^ ^are compare'd'~on~a~ 
tertiary structural basis, the large majority of differences are 
found on the surface of the molecule. 

Despite that, the inventors found that surface residues 

25 identified to be involved - directly or indirectly through water 
molecules - in crystal formations for the Termamyl-like amylase 
{SP722 disclosed in SEQ ID NO: 4) is also playing a key role for 
crystallisation of other Termamyl-like amylases (defined below) . 
Below is described one example of modelling a tertialy 

30 structure of one Termamyl-like alpha-alpha from another 
Termamyl-like alpha-amlase straucture (SP722 strcutre disclosed 
in APPENDIX 1) . 

It is to be understood that the concept of the invention and 
the below described modelling method according to the invention 
35 can generally be extrapolated to all polypeptides, proteins, in 
particular enzymes, such as alpha-amylases . 

The Tertiary Structure of SP722 and Modelling The Tertiary 
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Structures of Another Termamvl -like alpha - amylase . 

Mutants of alpha-amylases of the present invention have been 
found based on the tertiary structure shown in APPENDIX 1 of 
SP722 (SEQ ID NO: 4) . Mutants of other polypeptides may be found 
based on other tertiary structures. Crystals of the alkaline 
Termamyl-like alpha-amylase (SP722) (shown in SEQ ID NO: 4 and 
also disclosed in US patent no. 5,824,531) were obtained by the 
hanging drop method (well-known in the art, and the tertiary 
structure (3D-structure) is disclosed in APPENDIX 1 herein. 

The unit cell was found to contain 4 enzyme molecules and 
each molecule has 8 interacting zones (see Fig. 2) . Two zones 
have been identified on the side of the enzyme surrounding the 
-aet-i-ve si-fee and one -large area -is -found on the -backside -.of ^the 
alpha-amylase relative to the active site. There is further an 
interaction side at the bottom and one at the top of the 
molecule making top-to-bottom interactions. 

As can be seen from Fig. 2 the two interaction zones 
surrounding the active site is interacting with the same two 
areas on an anti -parallel neighbour molecule. Likewise the 
backside zone is in contact with the backside zone on a third 
ant 1 -para^riel'^myla^ aTF contTacfes" here" water" 

mediated. It can also be seen from the Fig. 2 that the 
interacting areas are spread all over the molecule. 

A model of another alkaline Termamyl-like amylase, AA560 has 
been build based on the SP722 tertiary structure disclosed in 
APPENDIX 1. The AA560 alpha-amylase is about 87% identical to 
the template amylase (SP722) and the alignment contains no 
insertion or deletions. Due to the high' homology (identity), the 
same symmetry and the same crystal interactions, the same 
interaction zones on the protein surface are involved in 
crystallization and precipitation at the increasing protein 
concentrations, which are reached during production, i.e., 
starting from fermentation stage through the purification stage 
(see Background section) . 

Mutations in these interaction zones were constructed, the 
enzyne expressed and purified, and the protein solubility was 
measured under different conditions as described in Example 8 
and 9 using the method described in the "Materials and Methods" 
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section . 

The findings of the present invention may be applied on 
Termamyl-like amylases being at least 60% identical, preferably 
at least 70% identical, more preferably 80% identical, even more 
preferably 85% identical, even more preferably 90% identical, 
even more 95% identical, even more 97% identical, even more 99% 
identical to the Termamyl-like alpha -amylase shown in SEQ ID NO: 
12. In a preferably the findings may be used on alkaline 
Termamyl-like alpha-amylases , especially alkaline alpha-amylases 
of the same length, without additional amino residues or gaps in 
an aligned primary structure in comparison to SP722 (SEQ ID NO: 
4 shown as number 1 in the alignment in Fig. 1). Especially, the 
finding may be used on the -f Ql-l-ow-i-Rg -aikal-ine Term.am.yl-- like 
alpha-amylases: SP690 (SEQ ID NO: 2), SP722 (SEQ ID NO: 4), 
AA560 (SEQ ID NO: 12), #707 alpha-amylase (SEQ ID NO: 13), the 
KSM APE 1378 alpha-amylase disclosed in KSM AP1378 alpha-amylase 
is disclosed in WO 97/00324, or fragment or truncated forms 
thereof. The latter mentioned alkaline alpha-amylases have very 
similar tertiary cryltal structure around the above-mentioned 
interactions zones, and have the same primary structure length 
485 amino acids. ' ^ — 

Contrary hereto, for instance, Termamyl (shown as sequence 
number 4 in the alignment in Fig. 1) lacks two amino acid 
residues (positions 1 and 2) ; has gaps in positions 174 and 181- 
182; and has three additional amino acid residues in positions 
378-381 when aligned with SP722. 

BAN (shown as sequence number 3 in the alignment in Fig. 1) 
lacks five amino acid residues (positions 1-4 and 488) ; has gaps 
in positions 174 and 181-182; and has three additional amino 
acid residues in positions 378-381 if aligned with SP722 . 
BSG (shown as sequence number 5 in the alignment in Fig. 1) 
lacks one amino acid residues (position 1) ; and has 31 
additional am.ino acid residues in positions 489-519 if aligned 
with SP722 . 

KSM-K36 and KSiy[-K38 (EP 1,022,334-A) lack five amino acid 
residues (positions 1 and 2) and has gaps in positions 174 and 
181-182 when aligned with SP722. 

AA180, AA20 and Amrk385 (Danish patent application no. PA 2000 
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00347) have one additional amino acid in position 261 when 
aligned with SP722. 

Below it is described how to model a Termamyl-like alpha- 
amylase from another alpha -amylase . This method can be 
5 exprepolated to other polypetides. 

Modelling of Termamyl-like Alpha-amylases 

WO 96/23874 provides the tertiary structure (3D Structure) , 
X-ray crystal structural data for a Termamyl-like alpha-amylase, 
10 which consists of the 3 00 N- terminal amino acid residues of the 
B. amyloliguefaciens alpha- amylase (BAN™) and amino acids 
301-483 of the C-terminal end of the B. licheniformis alpha- 
amylase (SEQ -I-D -NG: 8-) . -W0 96/~2-3-874 further describes 
methodology for designing (modelling) , on the basis of an 
15 analysis of the structure of a parent Termamyl-like alpha- 
amylase, variants of the parent Termamyl-like alpha- amylase 
which exhibit altered properties relative to the parent. 

Other Termamyl-like structures may be modelled in accordance 
with WO 96/23874, which is hereby incorporated by reference. 

In connection with obtaining variant of the present 
invention the AA560 tertiary ^ t future " was~~des^igned (moderied)" 
based on the tertiary structure of SP722 (disclosed in APPENDIX 
1) as described in Example 1. The structure of other Termamyl- 
like alpha-amylases (e.gr,, those disclosed herein) may be built 
analogously. 

Termamyl-like alpha-amylases 

A number of alpha-amylases produced by Bacillus spp . are 
highly homologous (identical) on the amino acid level. 
The identity of a number of Bacillus alpha-amylases can be found 
in the below Table 1 : 

Table 1 

Percent 
identity 





707 


AP1378 


BAN 


BSG 


SP690 


SP722 


AA560 


Termamyl 


707 


100.0 


86.4 


66.9 


66.5 


87.6 


86.2 


95.5 


58.1 


AP1378 


86.4 


100.0 


67.1 


68.1 


95.1 


86.6 


86.0 


69.4 


BAN 


66.9 


67.1 


100.0 


65.6 


67.1 


68.8 


66.9 


80.7 


BSG 


66.5 


68.1 


65.6 


100.0 


67.9 


67.1 


66.3 


65.4 
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SP690 


87.6 


95.1 


67.1 


9 

67.9 


100.0 


87.2 


87.0 


69.2 


SP722 


86.2 


86.6 


68.8 


67.1 


87.2 


100.0 


86.8 


70.8 


AA560 


95.5 


86.0 


66.9 


66.3 


87.0 


86.8 


100.0 


68.3 


Termamyl 


68.1 


69.4 


80.7 


65.4 


69.2 


70.8 


68.3 


100.0 



For instance, the B. licheniformis alpha-amylase comprising 
the amino acid sequence shown in SEQ ID NO: 8 (commercially 
available as TermamylTM) has been found to be about 81% 
5 homologous with the B, amyloliquefaciens alpha-amylase 
comprising the amino acid sequence shown in SEQ ID NO: 10 and 
about 65% homologous with the B. stearotheinnophilus alpha- 
amylase comprising the amino acid sequence shown in SEQ ID NO: 
6 . Further homologous alpha -amylases include SP690 and SP722 
G dirse-l-osed in -VIQ -9-5/263-9-7 ^and further .depicted in SEQ ID NO: 2 
and SEQ ID NO: 4, respectively, herein. Other amylases are the 
AA560 alpha-amylase derived from Bacillus sp. and shown in SEQ 
ID NO: 12, and the #707 alpha-amylase derived from Bacillus sp . , 
shown in SEQ ID NO: 13 and described by Tsukamoto et al . , 
Biochemical and Biophysical Research Communications , 151 (1988) , 
pp. 25-31. 

The KSM AP1378 alpha-amylase is disclosed in WO 97/00324 (from 
" KACT "Corporati^on)" "AIso^ tPTe" K3'8'^afrd^ K3'8 ~ a-lpha--amy l-as e s- drs c-lo s e d- 
in EP 1,022,334 are contemplated according to the invention. 

Still further homologous alpha-amylases include the alpha- 
amylase produced by the B. licheniformis strain described in EP 
0252666 (ATCC 27811), and the alpha-amylases identified in WO 
91/00353 and WO 94/18314. Other commercial Termamyl-like 
alpha-amylases are comprised in the products sold under the 
following tradenames: Optitherm^M and TakathermTM (available from 
Solvay) ; MaxamylTM (available from Gist-brocades/Genencor) , 
Spezym AAtm and Spezyme Delta AA'^" (available from Genencor) , and 
KeistaseTM (available from Daiwa) , Purastar™ ST 5000E, PURASTRA™ 
HPAM L (from Genencor Int.) . 

Because of the substantial homology found between these 
alpha-amylases, they are considered to belong to the same class 
of alpha-amylases, namely the class of "Termamyl-like alpha- 
amylases" . 

Accordingly, in the present context, the term "Termamyl-like 
alpha-amylase" is intended to indicate an alpha-amylase, which. 
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at the amino acid level, exhibits a substantial identity to 
TermamylTM, i.e., the B. licheniformis alpha-amylase having the 
amino acid sequence shown in SEQ ID NO: 8 herein. 

In other words, all the following alpha-amylases , which has 
5 the amino acid sequences shown in SEQ ID NOS : 2, 4, 6, 8, 10, 12 
and 13 herein are considered to be "Termamyl-like alpha- 
amylase". Other Termamyl-like alpha-amylases are alpha-amylases 
i) which displays at least 60%, such as at least 70%, e.g., at 
least 75%, or at least 80%, at least 85%, at least 90%, at least 
10 95%, at least 97%, at least 99% homology with at least one of 
said amino acid sequences shown in SEQ ID MOS : 2, 4, 6, 8, 10, 
12, and 13, and/or ii) is encoded by a DNA sequence which 
hybri"d"i zes to' the DNA sequences encodrng the above - speci-f ied 
alpha-amylases which are apparent from SEQ ID NOS: 1, 3, 5, 7, 9 
15 and of the present specification (which encoding sequences 
encode the amino acid sequences shown in SEQ ID NOS: 2, 4, 6, 8, 
10 and 12 herein, respectively) 

In connection with property i) , the homology may be 
determined as the degree of identity between the two sequences 
20 indicating a derivation of the first sequence • from the second. 
The homology may suitably be determined by means of computer 
programs known in the art such as GAP provided in the GCG 
program package (described above) . Thus, Gap GCGvS may be used 
with the following default parameters: GAP creation penalty of 
25 5.0 and GAP extension penalty of 0.3, default scoring matrix, 
for nucleic sequences and 3.0 and 0.1, respectively, from 
protein sequences. GAP uses the method of 

Needleman/Wunsch/Sellers to make alignments. 

A structural alignment between Termamyl (SEQ ID NO: 8) and a 
30 Termamyl-like alpha-amylase may be used to identify equiva- 
lent/corresponding positions in other Termamyl-like alpha- 
amylases. One method of obtaining said structural alignment is 
to use the Pile Up programme from the GCG package using default 
values of gap penalties, i.e., a gap creation penalty of 3 . 0 and 
35 gap extension penalty of 0.1. Other structural alignment methods 
include the hydrophobic cluster analysis (Gaboriaud et al . , 
(1987), FEES LETTERS 224, pp , 149-155) and reverse threading 
(Huber, T; Torda, AE, PROTEIN SCIENCE Vol. 7, No. 1 pp. 14 2-149 
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(1998) . 

Hybridisation 

The oligonucleotide probe used in the characterisation 
5 of the polypeptide, such as the Termamyl-like alpha -amylase in 
accordance with property ii) above may suitably be prepared on 
the basis of the full or partial nucleotide or amino acid 
sequence of the alpha-amylase in question. 

Suitable conditions for testing hybridisation involve pre- 
10 soaking in 5xSSC and prehybridizing for 1 hour at -4 0 °C in a 
solution of 20% formamide, SxDenhardt * s solution, 50mM sodium 
phosphate, pH 6.8, and 50mg of denatured sonicated calf thymus 
"DNA-, f oiiowed^ by -hybridisation i-n -the -same -SG-l-ufe-ion supplemented 
with lOOmM ATP for 18 hours at -'40°C, followed by three times 
15 washing of the filter in 2xSSC, 0.2% SDS at 40°C for 30 minutes 
(low stringency) , preferred at 50°C (medium stringency) , more 
preferably at SS^C (high stringency) , even more preferably at 
-75*^C (very high stringency) . More details about the 
hybridisation method can be found in Sambrook et al , , Molecular 

-20 Glonirng-:--A— Labor at o-ry— Manual^,- 2nd^ ^Ed.-^,— Cold-Spring^ Harbor,_ .1.9_8,9_.___ 

In the present context, "derived from" is intended not only 
to indicate an alpha-amylase produced or producible by a strain 
of the organism in question, but also an alpha-amylase encoded 
by a DNA sequence isolated from such strain and produced in a 
25 host organism transformed with said DNA sequence. Finally, the 
term is intended to indicate an alpha-amylase, which is encoded 
by a DNA sequence of synthetic and/or cDNA origin and which has 
the identifying characteristics of the alpha-amylase in 
question. The term is also intended to indicate that the parent 
30 alpha-amylase may be a variant of a naturally occurring alpha- 
amylase, i.e., a variant, which is the result of a modification 
(insertion, substitution, deletion) of one or more amino acid 
residues of the naturally occurring alpha-amylase. 

35 Parent Termamyl-like Alpha -amylases 

According to the invention all Termamy-like alpha-amylases , 
as defined above, may be used as the parent (i.e., backbone) 
alpha-amylase. In a preferred embodiment of the invention the 
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parent alpha-amylase is derived from S. licheniformis, e.g., one 
of those referred to above, such as the B. licheniformis alpha- 
amylase having the amino acid sequence shown in SEQ ID NO: 10. 
Especially preferred parent alpha-amylases are the SP722 alpha- 
5 amylase and the AA560 alpha-amylases. In one embodiment the 
parent alpha -amyase has one or more of the followimng 
mutations/substitutions: Delta (R81-G182); Delta (D183-G184) ; 
Delta (D183-G184) +N195F; RR181Q+N44 5Q+K44 6N; Delta (D183- 
G184) +R181Q. 

10 

Parent hybrid Termamyl - like Alpha-amylases 

The parent alpha-amylase (i.e., backbone alpha-amylase) may 
a-l-so- -be a hybr-i-d aipha- amyiase >r i .-e . an alpha - amylase , which 
comprises a combination of partial amino acid sequences derived 

15 from at least two alpha-amylases. 

The parent hybrid alpha-amylase may be one, which on the 
basis of amino acid homology (identity) and/or DNA hybridization 
(as defined above) can be determined to belong to the Termamyl - 
like alpha-amylase family. In this case, the hybrid alpha- 

20 amylase is typically composed of at least one part of a 
Termamyl - 1 i ke alpha - amy rase and^^aft ('s ) "of one ~"or ' Inofe ~ot heT " 
alpha-amylases selected from Termamyl-like alpha-amylases or 
non-Termamyl-like alpha-amylases of microbial (bacterial or 
fungal) and/or mammalian origin. 

25 Thus, the parent hybrid alpha-amylase may comprise a 

combination of partial amino acid sequences deriving from at 
least two Termamyl-like alpha-amylases, or from at least one 
Termamyl-like and at least one non-Termamyl-like bacterial 
alpha-amylase, or from at least one Termamyl-like and at least 

30 one fungal alpha-amylase. The Termamyl-like alpha-amylase from 
which a partial amino acid sequence derives, may be any of those 
specific Termamyl-like alpha-amylase referred to herein. 

For instance, the parent alpha-amylase may comprise a C- 
terminal part of an alpha-amylase derived from a strain of B. 

35 licheniformis , and a N-terminal part of an alpha-amylase derived 
from a strain of B, amyloliquefaciens or from a strain of B. 
s tea.ro thermophil us . For instance, the parent a-^niylase may 
comprise at least 430 amino acid residues of the C-terminal part 
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of the B. licheniformis alpha-amylase , and may, e.gr., comprise 
a) an amino acid segment corresponding to the 3 7 N- terminal 
amino acid residues of the S. amyloliquefaciens alpha-amylase 
having . the amino acid sequence shown in SEQ ID NO: 10 and an 
amino acid segment corresponding to the 445 C-terminal amino 
acid residues of the s. licheniformis alpha-amylase having the 
amino acid sequence shown in SEQ ID NO: 8, or a hybrid Termamyl- 
like alpha-amylase being identical to the Termamyl sequence, 
i.e., the Bacillus licheniformis alpha-amylase shown in SEQ ID 
NO: 8, except that the N- terminal 3 5 amino acid residues (of 
the mature protein) has been replaced by the N-terminal 33 
residues of BAN (mature protein), i.e., the Bacillus 
amylol-i-quefaciens a-Lpha-amy-lase shewn i-n SEQ I-D NO-: 1-0 ; or b) 
an amino acid segment corresponding to the 68 N-terminal amino 
acid residues of the B. stearothermophilus a-^mylase having the 
amino acid sequence shown in SEQ ID NO: 6 and an amino acid 
segment corresponding to the 415 C-terminal amino acid residues 
of the B. lichenifoirmis alpha-amylase having the amino acid 
sequence shown in SEQ ID NO : 8 . 

Another suitable parent hybrid alpha-amylase is the one 

previously described in WO 96/23874 (from Novo Nordisk) 
constituting the N- terminus of BAN, Bacillus amyloliquefaciens 
alpha-amylase (amino acids 1-300 of the mature protein) and the 
C-terminus from Termamyl (amino acids 301-483 of the mature 
protein) . 

Altered properties 

The following discusses the relationship between mutations, 
which are present in variants of the invention, and desirable 
alterations in properties (relative to those a parent Termamyl - 
like alpha-amylase) , which may result therefrom. 

As mentioned above the invention relates to Termamyl -like 
alpha -amylases with altered properties, in particular under 
production conditions. 

Parent Termamyl-like alpha-amylase specifically contemplated 
in connection with going through the specifically contemplated 
altered properties are the above mentioned parent Termamyl-like 
alpha-amylase and parent hydrid Termamyl-like alpha-amylases . 
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The SP722 alpha-amylase is used as the starting point, but 
corresponding positions in, e.g., the Termamyl , BSG, BAN, AA560, 
SP6 90, KSM AP13 78, and #707, K3 8, and K3 6 should be understood 
as disclosed too. 

In a preferred embodiment the variant of the invention has 
increased solubility, in particular under washing or cleaning 
conditions . 

In an aspect the invention relates to variant with altered 
properties as mentioned above. 

In the first aspect a variant of a parent Termamyl -like 
alpha-amylase, comprising an alteration at one or more positions 
(using SEQ ID MO: 12 for the amino acid numbering) selected from 
the group' of: 

R28, RUB, N174; R181, G182 , D183, G184, G186, W189, N195, M202, 
Y298, N299, K302, S303, N306, R310, N314; R320, H324, E345, 
¥396, R400, W439, R444, N445, K446, Q449, R458, N471, N484, 
wherein 

(a) the alteration (s) are independently 

(i) an insertion of ah amino acid downstream of the amino 
acid which occupies the position, 

(ii) a deletion of the amino acid which occupi~es the^ 
position, or 

(iii) a substitution of the amino acid which occupies the 
position with a different amino acid, 

(b) the variant has alpha-amylase activity and (c) each position 
corresponds to a position of the amino acid sequence of the 
parent Termamyl-like alpha-amylase having the amino acid 
sequence of AA560 shown in SEQ ID NO: 12. 

In SP722 (SEQ ID NO: 4) such corresponding positions are: R28; 
N94; L118; N125; Q174; R181; G182; D183 ; G184; A186; W189; N195, 
M202, Y298; N299; N302; S303; N306; A310; N314; K320; H324; 
Q345; F396, T400, W439, Q444 ; N445, K446, Q449, K458; N471, 
K4 84 . 

In a preferred embodiment the variant of the invention (using 
SEQ ID NO: 12 for the numbering) has one or more of the 
following mutations/substitutions : 

Delta G184; Delta (R181-G182) ; Delta (D183-G184); 
R28N,K; S94K; R118K; N125A,R,K; N174D; R181Q,E,K; G186R; 
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W189R,K; N195F; M202L; Y298H,F; N299A; K302R, S303Q, 

N306G,D,R,K; R3 1 OA, K, Q , E , H , D , N ; lSr314D; R320K; H324K; 

E345R,D,K,N; Y396F; R400T,K; W439R; R444K; N445K,Q; K446N; 

Q449E; R458K; N471E; N484Q. 
5 Preferred double, triple and mult i -mutations - using SEQ ID 

NO: 12 as the basis for the numbering - include: 

Delta (D183-G184) +R181Q; 

Delta (D183-G184) +G186R; 

Delta (D183-G184) +N195F; 
10 Delta (D183-G184) +M202L; 

Delta (D183-G184) +G186R+N195F; 
Delta (D183-G184) +N195F+R400T; 
Del t a CDr8-3 - G-1-8 4- ) --rN-1 9 5 F + W4-3 9R~; 
Delta (D183-G184) +N195F+Q449E; 
15 Delta (D183-G184) +N195F+N4 84Q; 
Delta (D183-G184) +N195F+K446N; 
Delta (D183-G184) +N195F+N445Q+K446N+N484E ; 
Delta (D183-G184) +N195F+N445Q+K446N+Q449E ; 
Delta (D183-G184) +N195F+N4 71E; 
20 Delta (D183-G184) +N195F+H3 24K; 
Del^lDl¥3~-'Gr84T -f^r95^FT^ 
Delta (D183-G184) +N195F+K4 4 6D; 
Delta (D183-G184) +N195F+R181Q ; 
Delta (D183-G184) +N195F+R181E; 
25 Delta (D183-G184) +N195F+N445QH-K446N; 
N445Q+K44 6N; 
N4 4 5Q+K4 4 6N+N4 8 4 E ; 
N445Q+K446NH-Q449E; 

Delta (D183-G184) +N195F+R181Q+N445Q+K446N; 
30 Delta (D183-G184) +N195F+R181Q+K446N; 

Delta (D183-G184) +N195F+R181Q+N44 5Q+K44 6N+N4 84E; 

Delta (D183-G184) +N195F+R181E+N445Q+K446N; 

Delta (D1S3-G184) -fN195F+R181E + K446N; 

Delta (D18 3-G184) +N195F+R181E+N445Q+K44 6N+N4 84E ; 
35 Delta (D183-G184) +N195F+N445Q+K446N+Y243F; 

Delta {D18 3-G184) +N195F+N44 5Q+K44 6N-fV2 0 91 ; 

Delta (D183-G184) +N195F+E2 12R; 

Delta (D183-G184) +N195F+M116R; 
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Delta 
Delta 
Delta 
Delta 
Delta 
Delta 
Delta 
Delta 
Delta 
Delta 
Delta 
Delta 
De-lt-a 
Delta 
Delta 
Delta 
Delta 
Delta 
Delta 
Delta 



D18 3-G184) +N195F+K14 2H+D144H+R15 8H; 
D183-G184) +N195F-fK142H+D144H; 
D183-G184) +N195F+R158H; 
D183-G184) +N195F-fE345R; 
D183-G184) +N195F+W189R, 
D183-G184) +N195F+Y298H; 
D183-G184) +N195F+N299A; 
D183-G184) +N195F+K302R+S303Q; 
D183-G184) +N195F+N306G; 

D183-G184) +N195F+Y2 98H+N2 9 9A+K3 02R+S3 03Q+N3 06G; 
D183-G184) +N195F+N125A; 
D183-G184) +N195F+N125R; 



D 1-8-3 -G 



1-84 ) +^T-i-o.c:.T?. 



N-1-9 5 F-i-R-l-8-l-Q 4-5Q+K4 4-6N+R^X0 A ; 



D183-G184) +N195F+R181Q+N44 5Q+K44 6N+R310A+Q311N; 
D183-G184) +N195F+R181Q+N44 5Q+K44 6N+R32 0K; 
D18 3-G184) +N195F+R181Q+N4 4 5Q+K44 6N+Q319K+R32 0D; 
D18 3-G184) +N195F+R181Q+N44 5Q+K44 6N+N3 0 6A; 
D183-G184) +N195F+R181Q+N4 4 5Q+K44 6N+K3 02N; 
D183-G184) +N195F+R181Q+N44 5Q+K44 6N+E34 5N; 
D18 3-G184) +N19 5F+R181Q+N44 5Q+K44 6N+Y2 98F; 



Delta (D183-G184) +N1 9 5 F1-F[TMQ+N4~T5 cr+K44"6N+R2Wr " 
Delta (D183-G184) +N195F+R181Q+N44 5Q+K44 6N+R2 8N+R310A; 
Delta (D18 3-G184) +N195F+R181Q+N44 5Q+K44 6N+N128D+N3 06D; 
Delta (D183-G184) +N19 5F+R181Q+N4 4 5Q+K44 6N+N12 8D; 
Delta (D18 3-G184) +N195F+R181Q+N4 4 5Q+K44 6N+N3 06D; 
N174D+N314D; 

Delta (D183-G184) +N195F+N306D+R310H+E345D; 
Delta (D183-G184) +R181Q+N195F+W189R+S94K; 
Delta {D18 3-G184) +R181Q+W18 9R+S94K; 
Delta (D183-G184) +N195F+R181Q-f S94K; 
Delta (D183-G184) +N195F+R181K+N125R; 
Delta (D183-G184) +N195F+R181K+N125K; 
Delta (D18 3 -G184) +N195F+R118K+R3 2 0K+R4 5 8K; 
Delta (D18 3-G184) +R181Q+N195F+R118K+R3 2 0K+R4 58K; 
Delta (D183-G184) +N195F+R181Q+N306G; 
Delta {D183-G184) +N195F+R181Q+W189R; 

Delta (D183-G184) +R181Q+N195F+R118K+N12 5K+R444K+N445K 
Delta (D183-G184) +R18 1Q+N195F+R118K+N12 5K+R32 0K+R4 58K; 
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Delta (D183-G184) +R181Q+N195F-hR118K+N12 5R+R3 2 0K+R4 58K; 
Delta (D183-G184) +R181Q+N195F+S94K+R118K+R320K+R458K; 
Delta (D183-G184) +R181Q+N195F+R118K+N306R-fR320K+R458K; 
Delta (D183-G184) +N195F+R118K+R320K+R458K+R444K+N445K; 
5 Delta (D183-G184) +N195F+S94K+R118K+N306R+R320K+R458K; 

Contemplated variants include the specific ones mentioned in 
the examples and the above mention mutations in further 
combination with one or more of the mutations/substitution: 
Delta {D183-G184) ; N195F; R181Q; G186R; M202L; V206F,L,M; 
10 N193S,T,P 

Increased solubility ' 

A-l-l- the above-mentioned mutat-i-Gns result i-n ^ermam.yl -like 
alpha-amylase variants with altered solubility when used in any 
15 alpha-amylase within the group of Termamyl -like alpha-amylase as 
defined above. 

In a preferred embodiment a variant of the invention with 

increased solubility in comparison to the parent alpha-amylase 

has one or more of the following substitutions: 
20 Delta G184; Delta (R181-G182); Delta (D183-G184) ; 

R2W,K; S94K; R118KT N12 5A7R7k7^ NI^TD 7^ fTrsXQTE^ K7 ~GI'86R"; 

W189R,K; N195F; M202L; Y298H,F; N299A; K302R, S303Q, 

N306G,D,R,K; R3 1 OA, K, Q , E , H , D , N ; N314D; R3-20K; H324K; 

E345R,D,K,N; Y396F; R400T,K; W439R; R444K; N445K,Q; K446N; 
25 Q449E; R458K; N471E; N484Q. 

In a more preferred embodiment variant of the invention with 

increased solubility (determined using one of the methods 

described in the "Materials and Methods" section) - using SEQ ID 

NO: 12 as the basis for the numbering - include: 
30 Delta (D183-G184) +R181Q; 

Delta (D183-G184) +G186R; 

Delta (D183-G184) +N195F; 

Delta (D183-G184) +M202L; 

Delta (D183-G184) +G186R+N195F ; 
35 Delta (D183-G184) +N195F+R4 00T; 

Delta (D183-G184) +N195F+W439R; 

Delta (D183-G184) +N195F+Q44 9E ; 

Delta (D183-G184) +N1 95F+N484Q ; 
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Delta (D183-G184) +N195F+K446N; 
Delta (D183-G184) +N195F+N4 4 5Q+K44 6N+N4 84E; 
Delta (D183-G184) +N195F+N4 4 5Q+K446N+Q44 9E; 
Delta (D183-G184) +N195F+N471E; 
Delta (D183-G184} +N195F+H324K; 
Delta (D183-G184) +N195F+Y396F; 
Delta (D183-G184) +N195F-HK4 4 6D; 
Delta (D183-G184) +N195F+R181Q; 
Delta (D183-G184) +N195F+R181E; 
Delta (D183 -G184 ) +N195F+N445Q+K446N; 
N445Q+K446N; 
N445Q+K44 6N+N4 84E; 
-N4 4 5Q+K4 4 6N'fQ4 4-9 E-; 



Delta 
Delta 
Delta 
Delta 
Delta 
Delta 
Delta 



Delta" 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 

Delta 



D183-G184) +N195F+R181Q+N44 5Q+K44 6N; 

D183-G184 ) +N195F+R181Q+K446N; 

D183-G184 ) +N195F+R181Q+N445Q+K446N+N484E; 

D183-G184 ) +N195F+R181E+N445Q+K446N; 

D183-G184) +N195F+R181E+K446N; 

D18 3-G184) +N195F-fR181E+N44 5Q+K44 6N+N4 84E; 

D183-G184 ) +N195F+N445Q+K446N+Y243F; 



Dl 8 3^GT8T)~+NT95FTN44'5Q^^ 
D183-G184) +N195F+E212R; 
D183-G184) 4-N195F+M116R; 
D18 3-G184) +N195F+K14 2H+D144HH-R158H; 
D18 3-G184) -f-N195F+K142H+D144H; 
D183-G184) +N195F+R158H; 
D183-G184) +N195F+E345R; 
D183-G184) +N195F+W189R, 
D183-G184) +N195F+Y298H; 
D183-G184) +N195F+N2 99A; 
D183-G184) +N195F+K302R+S303Q; 
D18 3-G184) +N195F+N3 06G; 

D18 3-G184) +N195F+Y2 98H+N2 99A+K3 02R+S3 03Q+N3 06G; 
D183-G184) +N195F+N125A; 
D183-G184) +N195F+N125R; 

D18 3 -G184 ) +M19 5F+R181Q+N44 5Q+K44 6N+R310A; 
D183-G184) +N195F+R181Q+N44 5Q+K44 6N+R310A+Q311N; 
D183-G184 ) +N195F+R181Q+N445Q+K446N+R320K; 
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Delta (D183-G184) +N195F+R181Q+N44 5Q+K44 6N+Q319K+R32 0D; 
Delta (D18 3-G184) +N195F+R181Q+N44 5Q+K44 6N+N3 0 6A; 
Delta (D183-G184) H-N195F+R181Q+N44 5Q+K44 6N+K3 02N; 
Delta (D183-G184) +N195F-I-R181Q+N44 5Q+K44 6N+E34 5N; 
Delta (D183-G184) +N195F+R181Q+N44 5Q+K44 6N+Y2 98F; 
Delta (D183-G184) +N195F+R181Q+N445Q+K446N+R28N; 
Delta (D18 3-G184) +N195F+R181Q+N44 5Q+K44 6N+R2 8N+R310A; 
Delta (D183-G184) +N195F+R181Q+N44 5Q + K44 6N+N12 8D4-N3 06D; 
Delta (D183-G184) +N195F+R181Q+N44 5Q+K44 6N+N12 8D; 
Delta (D183-G184) +N195F+R181Q+N445Q+K446N+N306D; 
N174D+N314D; 

Delta (D183-G184) +N195F+N306D+R310H+E345D; 
"Del ta ( DTS 3 - GrS 4 ) ^'R 1 8'1QtN1"9 S^F+'W 18 9 Rh- S 9 4-K ; 
Delta (D183-G184) +R181Q+W18 9RH-S94K; 
Delta (D183-G184) +N195F4-R18 1Q+S94K; 
Delta (D183-G184) +N195F+R181K+N125R; 
Delta (D183-G184) +N195F+R181K+N125K; 
Delta (D183-G184) +N195F+R118K+R320K+R458K; 
Delta (D183-G184) +R181Q+N195F-f R118K+R320K+R458K; 
Delta (D183-G184) +N195F+R181Q+N306G; 



Delta (D183-G184) +N195F4-R181Q+W189R; 

Delta (D183-G184) +R181Q+N195F+R118K+N125K+R444K+N445K 
Delta (D183-G184) +R181Q+N195F+R118K+N125K+R320K+R458K; 
Delta (D183-G184) +R181Q+N19 5F+R118K+N12 5R+R32 0K-fR4 58K; 
Delta (D183-G184) +R181Q+N195F+S94K+R118K+R320K+R458K; 
Delta (D183-G184) +R181Q+N195F+R118K+N306R+R320K+R458K; 
Delta (D183-G184) +N195F+R118K+R320K+R458K+R444K+N445K; 
Delta (D183-G184) +N195F+S94K+R118K+N306R+R320K+R458K; 

Other combinations according to the invention include the 
specific one mentioned in the examples and one or more of the 
following substitutions: N195F; R181Q; G186R; M202L; V206F,L,M, 
N193P. 



Stability 

In the context of the present invention, mutations 
(including amino acid substitutions) of importance with respect 
to achieving altered stability, in particular improved stability 
(i.e., higher or lower), at especially high pH (i.e., pH 8-10.5) 
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include any of the mutations listed in the in "Altered 
properties" .section. 

Ca^" stability 

5 Altered Ca^"^ stability means the stability of the enzyme 

under Ca^"*" depletion has been improved, i.e., higher or lower 
stability. In the context of the present invention, mutations 
(including amino acid substitutions) of importance with respect 
to achieving altered Ca^'" stability, in particular improved Ca^"^ 
10 stability, i.e., higher or lower stability, at especially high 
pH (i.e., pH 8-10.5) include any of the mutations listed in the 
in "Altered properties" section. 

Specific activity 

15 In a further aspect of the present invention, important 

mutations with respect to obtaining variants exhibiting altered 
specific activity, in particular increased or decreased specific 
activity, especially at temperatures from 10-60°C, preferably 20- 
50°C, especially 30-40°C, include any of the mutations listed in 

20_^_t_he in "Altered properties" sectj^on^ 

General mutations in variants of the invention 

The particularly interesting amino acid substitution are 

those that increase the mobility around the active site of the 
25 enzyme. This is accomplished by changes that disrupt stabilizing 

interaction in the vicinity of the active site, i.e., within 

preferably lOA or 8A or 6A or 4A from any of the residues 

constituting the active site. 

Examples are mutations that reduce the size of side chains, 
30 such as 

Ala to Gly, 

Val to Ala or Gly, 

lie or Leu to Val, Ala, or Gly 

Thr to Ser 

35 Such mutations are expected to cause increased flexibility in 

the active site region either by the introduction of cavities 
or by the structural rearrangements that fill the space left by 
the mutation . 
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It may be preferred that a variant of the invention comprises 
one or more modifications in addition to those outlined above. 
Thus, it may be advantageous that one or more Proline residues 
present in the part of the a-amylase variant which is modified 
5 is/are replaced with a non- Proline residue which may be any of 
the possible, naturally occurring non-Proline residues, and 
which preferably is an Alanine, Glycine, Serine, Threonine, 
Valine or Leucine. 

Analogously, it may be preferred that one or more Cysteine 
10 residues present among the amino acid residues with which the 
parent alpha-amylase is modified is/are replaced with a non- 
Cysteine residue such as Serine, Alanine, Threonine, Glycine, 
Val ine or XJe u c"i"ne . 

Furthermore, a variant of the invention may - either as the 
15 only modification or in combination with any of the above 
outlined modifications - be modified so that one or more Asp 
and/or Glu present in an amino acid fragment corresponding to 
the amino acid fragment 185-209 of SEQ ID NO: 10 is replaced by 
an Asn and/or Gin, respectively. Also of interest is the 
20 replacement, in the Termamyl-like alpha-amylase, of one or more 
of the Lys residues present in an amino acid fragment 
corresponding to the amino acid fragment 185-209 of SEQ ID NO: 
10 by an Arg. 

It will be understood that the present invention encompasses 
25 variants incorporating two or more of the above outlined 
modifications . 

Furthermore, it may be advantageous to introduce mutations in 

one or more of the following positions (using SEQ ID NO: 10 

(Termamyl) for the numbering) : 
30 M15, V128, Alll, H133, W138, T149, M197, N188, A209, A210, H405, 

T412, in particular the following single, double or triple or 

multi mutations: 

M15X, in particular M15T,L; 

V128X, in particular V128E; 
35 H133X, in particular H133Y; 

N188X, in particular N188S,T,P; 

M197X, in particular M197T,L; 

A209X, in particular A209V; 
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IVI197T/W138F; M197T/W138Y; M15T/H133Y/N188S ; 

M15/V128E/H133y/N188S; E119C/S130C; D124C/R127C; H133Y/T149I; 
G475R, H133Y/S187D; H133Y/A209V; 
In AA560 this correspond to 
5 L17X, in particular L17T; 
E130X; 
Y135X; 

W140X, In particular W140F,Y; 
T193X; in particular S,P; 
10 M202X, in particular M202T,L; 
V214X; 

Contemplated combinations thereof, include 
M2X)2TJ.VI1A.0.E :. _M2.0.2.T/W.1_4,0Y ; LI 7T /T1.93.a; 
L17T/N193P; E121C/S132C; N126C/Q129C; T151I; 
15 G480R. 

Methods for preparing alpha- amylase variants of the invention 

Several methods for introducing mutations into genes are 
known in the art. After a brief discussion of the cloning of 
20 alpha -amylase -encoding DNA sequences, methods for generating 
^mutat rons' ~ at~ ~sp"e'c i"f xc^" s it'e's wrthin" ~t"h"e a"lpha~amy las e - encoding- 
sequence will be discussed. 

CloninQ a DNA secpjence encoding an alpha-amylase 

The DNA sequence encoding a parent alpha-amylase may be 
isolated from any cell or microorganism producing the alpha- 
amylase in question, using various methods well known in the 
art. First, a genomic DNA and/or cDNA library should be 
constructed using chromosomal DNA or messenger RNA from the 
organism that produces the alpha-amylase to be studied. Then, if 
the amino acid sequence of the alpha-amylase is known, 
homologous, labeled oligonucleotide probes may be synthesized 
and used to identify alpha-amylase-encoding clones from a 
genomic library prepared from the organism in question. Alter- 
natively, a labeled oligonucleotide probe containing sequences 
homologous to a known alpha-amylase gene could be used as a 
probe to identify alpha-amylase-encoding clones, using 
hybridization and washing conditions of lower stringency. 
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Yet another method for identifying a* amylase -encoding clones 
would involve inserting fragments of genomic DNA into an ex- 
pression vector, such as a plasmid, transforming a-amylase- 
negative bacteria with the resulting genomic DNA library, and 
then plating the transformed bacteria onto agar containing a 
substrate for alpha-amylase, thereby allowing clones expressing 
the alpha-amylase to be identified. 

Alternatively, the DNA sequence encoding the enzyme may be 
prepared synthetically by established standard methods, e.g., 
the phosphoroamidite method described by S.L. Beaucage and M.H. 
Caruthers, Tetrahedron Letters 22 , 1981, pp. 1859-1869 or the 
method described by Matthes et al . , The EMBQ J . 3, 1984, pp. 
801-805. in the phosphoroamidite method, oligonucleotides are 
synthesized, e.g., in an automatic DNA synthesizer, purified, 
annealed, ligated and cloned in appropriate vectors. 

Finally, the DNA sequence may be of mixed genomic and syn- 
thetic origin, mixed synthetic and cDNA origin or mixed genomic 
and cDNA origin, prepared by ligating fragments of synthetic, 
genomic or cDNA origin (as appropriate, the fragments 
corresponding to various parts of the entire DNA sequen ce) , in 
accordance with standard techniques. The DNA sequence may also 
be prepared by polymerase chain reaction (PCR) using specific 
primers, for instance as described in US 4,683,202 or R.K. Saiki 
et al.. Science 239 , 1988, pp. 487-491. 

Expression of alpha-amylase variants 

According to the invention, a DNA sequence encoding the 
variant produced by methods described above, or by any alterna- 
tive methods known in the art, can be expressed, in enzyme form, 
using an expression vector which typically includes control 
sequences encoding a promoter, operator, ribosome binding site, 
translation initiation signal, and, optionally, a repressor gene 
or various activator genes. 

The recombinant expression vector carrying the DNA sequence 
encoding an alpha-amylase variant of the invention may be any 
vector that may conveniently be subjected to recombinant DNA 
procedures, and the choice of vector will often depend on the 
host cell into which it is to be introduced. Thus, the vector 
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may be an autonomously replicating vector, i.e., a vector which 
exists as an extrachromosomal entity, the replication of which 
is independent of chromosomal replication, e.g., a plasmid, a 
bacteriophage or an extrachromosomal element, minichromosome or 
an artificial chromosome. Alternatively, the vector may be one 
which, when introduced into a host cell, is integrated into the 
host cell genome and replicated together with the chromosome ( s ) 
into which it has been integrated. 

In the vector, the DNA sequence should be operably connected 
to a suitable promoter sequence. The promoter may be any DNA 
sequence, which shows transcriptional activity in the host cell 
of choice and may be derived from genes encoding proteins either 
-homoi-ogouB ox^- -heterologous to the host ce-ll . Examples of 
suitable promoters for directing the transcription of the DNA 
sequence encoding an alpha-amylase variant of the invention, 
especially in a bacterial host, are the promoter of the lac 
operon of E.coli, the Streptomyces coelicolor agarase gene dagA 
promoters, the promoters of the Bacillus licheniformis alpha- 
amylase gene (amyL) , the promoters . of the Bacillus 
stearothermophilus maltogenic amylase gene {amyM) , the promoters 
of the Bacillus amyloliqxiefaciens alpha-amylase iamyQ) , the 
promoters of the Bacillus suhtilis xylA and xylB genes etc. For 
transcription in a fungal host, examples of useful promoters are 
those derived from the gene encoding A. oryzae TAKA - amylase, 
Rhizomucor miehei aspartic proteinase, A. niger neutral alpha- 
amylase, A. niger acid stable alpha-amylase, A. niger glu- 
coamylase, Rhizomucor miehei lipase, A. oryzae alkaline 
protease, A. oryzae triose phosphate isomerase or A. nidulans 
acetamidase . 

The expression vector of the invention may also comprise a 
suitable transcription terminator and, in eukaryotes, poly- 
adenylation sequences operably connected to the DNA sequence 
encoding the alpha-amylase variant of the invention. Term.ination 
and polyadenylation sequences may suitably be derived from the 
same sources as the promoter. 

The vector may further comprise a DNA sequence enabling the 
vector to replicate in the host cell in question. Examples of 
such sequences are the origins of replication of plasmids pUC19, 
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pACYC177, pUBllO, pE194, pAMBl andpIJ702. 

The vector may also comprise a selectable marker, e.g. a gene 
the product of which complements a defect in the host cell, such 
as the dal genes from B, subtilis or B. licheniformis, or one 
which confers antibiotic resistance such as ampicillin, 
kanamycin, chloramphenicol or tetracyclin resistance. Fur- 
thermore, the vector may comprise Aspergillus selection markers 
such as amdS, argB, niaD and sC, a marker giving rise to 
hygromycin resistance, or the selection may be accomplished by 
CO- transformation, e.g., as described in WO 91/17243. 

While intracellular expression may be advantageous in some 
respects, e.g., when using certain bacteria as host cells, it is 
generalTy pre"f erred that the" "express ion" is extracellular. In 
general, the Bsicillus a-^Wl^ses mentioned herein comprises a 
preregion permitting secretion of the expressed protease into 
the culture medium. If desirable, this preregion may be replaced 
by a different preregion or signal sequence, conveniently accom- 
plished by substitution of the DNA sequences encoding the 
respective preregions . 

The procedures used to lig ate the DN A c onstruct of the^ i nven - 

tion encoding an alpha-amylase variant, the promoter, terminator 
and other elements, respectively, and to insert them into 
suitable vectors containing the information necessary for 
replication, are well known to persons skilled in the art (cf., 
for instance, Sambrook et al . , Molecular Cloning: A Laboratory 
Manual, 2nd Ed., Cold Spring Harbor, 1989). 

The cell of the invention, either comprising a DNA construct 
or an expression vector of the invention as defined above, is 
advantageously used as a host cell in the recombinant production 
of an alpha-amylase variant of the invention. The cell may be 
transformed with the DNA construct of the invention encoding the 
variant, conveniently by integrating the DNA construct (in one 
or more copies) in the host chromosome. This integration is 
generally considered to be an advantage as the DNA sequence is 
more likely to be stably maintained in the cell. Integration of 
the DNA constructs into the host chromosome may be performed 
according to conventional methods, e.g., by homologous or 
heterologous recombination. Alternatively, the cell may be 
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transformed with an expression vector as described above in 
connection with the different types of host cells. 

The cell of the invention may be a cell of a higher organism 
such as a mammal or an insect, but is preferably a microbial 
cell, e.g., a bacterial or a fungal (including yeast) cell. 

Examples of suitable bacteria are Gram-positive bacteria such 
as Bacillus siibtilis, Bacillus lichenifoirmis, Bacillus lentus, 
Bacillus bre\ris, Bacillus stearothermophilus , Bacillus alkalo- 
philus, Bacillus amyloliquefaciens, Bacillus coagulans, Bacillus 
circulans, Bacillus lautus, Bacillus mBgaterium, Bacillus 
thuringiensis , or Streptomyces lividans or Streptomyces murinuSr 
or gramnegative bacteria such as E.coli. The transformation of 
the-baG^teria— raay^ -for i-nstanceT -be ef-f ected- -by~ protopl-ast -trans - 
formation or by using competent cells in a manner known per se. 

The yeast organism may favorably be selected from a species 
of Saccharomyces or Schizosaccharomyces , e.g. Saccharomyces 
cerevisiae. The filamentous fungus may advantageously belong to 
a species of AspergilluSr e.g., Aspergillus oryzae or Aspergil- 
lus niger. Fungal cells may be transformed by a process involv- 
ing protoplast formation and transformation of the protoplasts 
"foriowed"by regeneration of the~cell wall in 'a~manner known^per 
se. A suitable procedure for transformation of Aspergillus host 
cells is described in EP 238 023. 

In a yet further aspect, the present invention relates to a 
method of producing an alpha -amylase variant of the invention, 
which method comprises cultivating a host cell as described 
above under conditions conducive to the production of the 
variant and recovering the variant from the cells and/or culture 
medium . 

The medium used to cultivate the cells may be any convention- 
al medium suitable for growing the host cell in question and 
obtaining expression of the alpha-amylase variant of the invent- 
ion. Suitable media are available from, commercial suppliers or 
may be prepared according to published recipes (e.g., as 
described in catalogues of the American Type Culture Col- 
lection) . 

The alpha-amylase variant secreted from the host cells may 
conveniently be recovered from the culture medium by well-known 
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procedures, including separating the cells from the medium by 
centrif ugation or filtration, and precipitating proteinaceous 
components of the medium by means of a salt such as ammonium 
sulphate, followed by the use of chromatographic procedures such 
5 as ion exchange chromatography, affinity chromatography, or the 
like . 

Industrial Applications 

The alpha-amylase variants of this invention possess valuable 

10 properties allowing for a variety of industrial applications. In 
particular, enzyme variants of the invention are applicable as a 
component in washing, dishwashing and hard surface cleaning 
detergent compos rti"ons. Variant" "of the invention with a~l"tered 
properties may be used for starch processes, in particular 

15 starch conversion, especially liquefaction of starch (see, e.g., 
US 3,912,590, EP patent publications Nos . 252 730 and 63 909, 
and WO 99/19467 . 

Further, variants of the invention are also particularly 
useful in the production of sweeteners and ethanol from starch, 

20 and/or for textile desizing 



Composition 

In an aspect the invention relates to a composition comprising a 
variant of the invention. 

25 

Detergent compositions 

As mentioned above, variants of the invention may suitably be 
incorporated in detergent compositions. Reference is made, for 
example, to WO 96/23874 and WO 97/07202 for further details 

30 concerning relevant ingredients of detergent compositions (such 
as laundry or dishwashing detergents) , appropriate methods of 
formulating the variants in such detergent compositions , and for 
examples of relevant types of detergent compositions. 

Detergent compositions comprising a variant of the invention 

35 may additionally comprise one or more other enzymes, such as a 
lipase, cutinase, protease, cellulase, peroxidase or laccase, 
and/or another alpha-amylase. 
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Alpha -amylase variants of the invention may be incorporated 
in detergents at conventionally employed concentrations. It is 
at present contemplated that a variant of the invention may be 
incorporated in . an amount corresponding to 0.00001-10 mg 
(calculated as pure, active enzyme protein) of alpha-amylase per 
liter of wash/dishwash liquor using conventional dosing levels 
of detergent . 

The invention also relates to a method of providing alpha- 
amylases with altered properties relative to the parent alpha- 
amylase, in particular solubility, especially increased 
solubility, substrate specificity, substrate binding, substrate 
cleavage pattern, temperature stability, pH dependence of 
ehzymat-ic activity, pH dependence of stability, stability 
towards oxidation, Ca'*- dependency and specific activity, 
comprising the following steps: 

(a) modelling the parent alpha-amylase on the three- 
dimensional structure of SEQ ID NO: 4 depicted in the Appendix 1 
to produce a three-dimensional structure of the parent alpha- 
amylase ; 

(b) identifying in the three-dimensional structure obtained in 
step (a) ~at least one st^uStural" pi:ft " of ' th~e "parentf- wherein- "an- 
alteration in said structural part is predicted to result in 
said altered property; 

(c) modifying the sequence of a nucleic acid encoding the parent 
alpha-amylase to produce a nucleic acid encoding a deletion, 
insertion, or substitution of one or more amino acids at a 
position corresponding to said structural part; and 

(d) expressing the modified nucleic acid in a host cell to 
produce the variant alpha-amylase, 

wherein the variant has alpha-amylase enzymatic activity and has 
at least one altered property relative to the parent. 
In an embodiment the method is a method of constructing a 
variant of a parent alpha-amylase having an altered property 
relative to the parent, wherein the parent alpha-amylase has the 
sequence of SEQ ID Nos : 2, 4, 6, 8, 10, 12 or 13 or has a 
sequence at least 60 % identity {determined as described above) 
to said sequence, said method comprising: 

(a) modelling the parent alpha-amylase on the three-dimensional 
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Structure of SEQ ID NO: 4 depicted in the Appendix 1 to produce a 
three-dimensional structure of the parent alpha-amylase ; 

(b) comparing the three-dimensional structure obtained in step 
(a) with a three-dimensional structure of an unrelated alpha- 

5 amylase, wherein the unrelated alpha-amylase differs from the 
parent alpha-amylase in said property; 

(c) identifying a structural part of the three-dimensional 
structure obtained in step (a) which is different from the 
three-dimensional structure of the unrelated alpha-amylase and 

10 which is predicted to be relevant to said property, 

(d) modifying the sequence of a nucleic acid encoding the parent 
alpha-amylase to produce a nucleic acid encoding a deletion, 
— insert ion, - or — subst-i-tut-ion- -of one -or m.ore -amino, -acids -3-t a 
position corresponding to said structural part; and 

15 (e) expressing the modified nucleic acid' in a host cell to 
produce the variant alpha-amylase, 

wherein the variant has alpha-amylase activity and has one or 
more altered properties as compared to the parent alpha-amylase. 
The Termamyl-like alpha-amylases contemplated are the ones 
20 described above. 



Method of increasing the solubility of polypeptides 

The present invention also relates to a method of increaning 
the solubility of polypeptides, such as enzymes, in particular 
25 Termamyl-like alpha-amylases. 

The method of this invention comprises substitutions, 
insertions and/or deletion of one or more amino acid 
residue (s), which residue (s) hold a position close to a 
neighbour polypeptide molecule located in the protein crystal. 

30 In the context of this invention, a polypeptide amino acid 
residue holding a position close to the neighbour molecule 
indicates an amino acid residue located within the polypeptide 
in a way that it is within a potential intermolecular 
interactive distance from an amino acid residue located at a 

35 neighbour enzyme molecule in the crystal or precipitate. 

Examples of potential intermolecular interactions include, 
but are not limited to hydrogen bonding, salt bridge formation. 
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polar interactions, hydrophobic interactions and aromatic 
interactions . 

The term "neighbouring" means in the context of the present 
invention the shortest distance between two positions in a 
5 crystal structure being prepared, e.g., as described in 

"Protein Crystallization Techniques, Strategies, and Tips", A 
Laboratory Manual by Terese M. Bergfors) , or as describe on the 
Internet on the site: www . hamptonresearch . com . 

Thus, in this aspect the invention relates to a method of 
10 increasing the solubility of enzyme crystals, wherein one or 
more amino acid residue (s), 

1) located within a distance of 6.0 A of a neighbouring 
polyptide molecule in the tertiary crystal structure, and 

2) interacting with said neighbouring polypeptide molecule, 

15 are mutated, e.g., by substitution or deletion. 

In a preferred embodiment the amino acid residues to be 
mutated are located with 3 . 5 A of a neighbouring polypeptide. 

Contemplated polypeptides include antimicrobial 

-pol-ypeptides— -polypeptides — wrth--biol-ogi-ca-l- -ac tivity,— such- -as^ 

20 insulin, growth hormone > EPO, TPO, factor VII, Factor VIII. 

Contemplated enzyme activities include protease, amylase, 
CGTase, carbohydrase , transferase, lyase, oxidoreductase , 
lipase, cellulase, cutinase, pectate lyase, mannanase, 
maltogenic amylase, glucoamylase , pectin lyase activity. 
25 In a preferred embodiment the enzyme is an amylase, 

preferably an alpha -amylase , especially a Termamyl-like alpha - 
amylase as will be described further below. 

An amino acid position "close" to the substrate indicates a 
distance less than 6 A (angstrom) corresponding to a protein- 
30 protein interaction mediated by a single water molecule. In a 
preferred embodiment an amino acid position close to the 
substrate indicates a distance less than 3.5 A (angstrom) 
corresponding to a direct protein-protein interaction. 
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Method of increasing the solubility of Termamyl-like alpha- 
aylases 

In this aspect the invention relates to a method of 
increasing the solubility of Termamyl -like alpha-amylase 
crystals, wherein one or more amino acid residue (s), 

1) located within a distance of 6 . 0 A of a neighbouring 
Termamyl -like alpha-amylase molecule in the tertiary crystal 
structure, and 

2) interacting with said neighbouring Termamyl -like alpha- 
amylase molecule, 

are mutated, e.g., by substitution or deletion. 

In a preferred embodiment the amino acid residues to be mutated 
are located with 3 . 5 A of a neighbouring Termamyl-like alpha- 
amylase. The crystal structure of SP722 depicted in APPENDIX 1 
may be used as the reference (fix-point) for determining the 
distance between neighbouring alpha-amylases . 

However, it is within the scope of the invention that the 
reference (fix-point) crystal structure is a structure modelled 

— (-as-descri-bed~above-or-in--WO~96y'23-8 7-4-)--f rom-SP 

APPENDIX 1. 

Preferred Termamyl-like alpha-amylases include Bacillus 
alpha-amylase selected from the group consisting of alpha- 
amylase derived from a strain of B. licheniformis, B. amyloli- 
quefaciens, B. stearothermophilus , Bacillus sp. NCIB 12289, NCIB 
12512, NCIB 12513 or DSM 9375, or DSMZ no. 12649, KSM AP1378 . 
Termamyl-like alpha-amylase may be any of the alpha-amylases 
selected from the group depicted in SEQ ID NOS : 2, 4, 6, 8, 10, 
12, and 13, or Termamyl-like alpha-amylase having an amino acid 
sequence which has a degree of identity to SEQ ID NO: 4 of at 
least 60%, preferably 70%, more preferably at least 80%, even 
m.ore preferably at least about 90%, even more preferably at 
least 95%, even more preferably at least 97%, and even more 
preferably at least 99%, or Termamyl-like alpha-amylase encoded 
by a nucleic acid sequence, which hydridizes under low, 
preferably medium, preferred high stringency conditions, with 
the nucleic acid sequence of SEQ ID NO: 11. The conditions are 
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described in further details above. 

Termamyl-like alpha-amylase amino acid residues being less 
than 6.0 A from the nearest neighbour amylase molecule are the 
following (using the SP722 numbering) : 



5 19, 


20, 21, 22, 


25, 


28, 


29, 53, 76, 


84, 


87, 


90, 93 


, 94, 


124, 


125, 


126, 


128, 


142 , 


144, 


156, 


157, 


158 , 


159, 


160 , 


161 , 


170, 


171, 


172, 


173, 


174 , 


175, 


183 , 


184 , 


185, 


186, 


187, 


188 , 


189, 


190, 


193, 


195, 


196, 


197, 


209, 


212, 


226, 


229, 


256, 


257, 


258, 


259, 


280, 


281, 


298, 


299, 


300, 


302, 


303, 


304, 


305, 


306, 


310, 


10 311, 


314, 


319, 


320 , 


321, 


322 , 


341, 


345, 


405, 


406, 


408 , 


444 , 


447, 


448, 


449, 


463 , 


464, 


465, 


466, 


467 . 











Termamyl-like alpha-amylase amino acid residues being less 

_4^-'U.— n rr .-a -P ^. t-ju ^ ^ ^ — „ ^ *- « „ W « . , l, ^ ^ ^ ,1-, ^ *_ "U ^ 

Uiictii J . rt. xj-v^ui uiitr liCdO. u iic: xy iiij(-;u.i. cxiiiy x ctc?cr "un^urcro u~x:c: ci x c; u'lic 

following (using the SP722 numbering) : 
15 22, 25, 28, 76, 94, 125, 128, 158, 160, 171, 173, 174, 184, 

189, 209, 226, 229, 298, 299, 302, 306, 310, 314, 320, 345, 

405, 447, 466. 

The amino acid residues being less than 6.0 A from the 

nearest neighboring amylase molecule are the following 
20 positions identified in the model structure of the SP722 

amylase : 

ASN 19, ASP 20, GLY 21, GLN 22, ASN 25, ARC 28, ASP 29, GLN 53, 
ARG 76, GLN 84, SER 87, HIS 90, LYS 93, ASN 94, ASN 125, ASN 



126, 


ASN 


128, 


LYS 


142 , 


ASP 


144 , 


LYS 


156, 


TRP 


157, 


ARG 


158, 


TRP 


159, 


TYR 


160, 


HIS 


161, 


PHE 


173 , 


GLN 


174, 


ASN 


175, 


ASP 


183, 


GLY 


184, 


LYS 


185, 


ALA 


186, 


TRP 


187, 


ASP 


188 , 


TRP 


189, 


GLU 


190, 


SER 


193, 


GLY 


196, 


ASN 


197, 


ASP 


209, 


GLU 


212, 


ASN 


226, 


ASN 


229, 


ALA 


256, 


THR 


257, 


GLY 


258, 


LYS 


259, 


ASN 


280, 


LYS 


281, 


TYR 


298, 


ASN 


299, 


ALA 


300, 


ASN 


302 , 


SER 


303 , 


GLY 


304 , 


GLY 


305, 


ASN 


306, 


ALA 


310, 


ASN 


314, 


GLN 


319, 


LYS 


320, 


HIS 


321, 


PRO 


322, 


GLU 


341, 


GLN 


345, 


PHE 


405, 


ASP 


406, 


HIS 


408 , 


GLN 


444, 


ALA 


447, 


GLY 


448, 


GLN 


449, 


THR 


463, 


ILE 


464 , 


ASN 


465, 


ALA 


466, 


ASP 


467 . 










The 


amino acid residues being 


less 


than 


.3.5 


A from the 


nearest neighboring amylase 


1 molecule 


; are 


the 


following 





35 positions identified in the model structure of the SP722 
amylase : 

GLN 22, ASN 25, ARG 28, ARG 76, ASN 94, ASN 125, ASN 128, ARG 
158, TYR 160, PHE 173, GLN 174, GLY 184, TRP 189, ASP 209, ASN 
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226, ASN 229, TYR 298, ASN 299, ASN 302, ASN 306, ASN 314, LYS 
320, GLN 345, PHE 405, ALA 447, ALA 466. 

The amino acid residues being less than 6.0 A from the 
nearest neighboring amylase molecule are the following 
5 positions identified in the model structure of the AA560 

amylase: ASN 19, ASP 20, ASN 22, ASN 25, ARG 28, SER 29, GLN 



53, 


ARG 


76, GLN 84, ALA 87, 


ASN 90, 


LYS 


93, 


SER : 


94, PRO 124, 


ASN 


125, 


ASN 


126, 


ASN 


128, 


LYS 


142, 


ASP 


144 , 


LYS 


156, 


TRP 


157, 


ARG 


158, 


TRP 


159, 


TYR 


160, 


HIS 


161, 


SER 


170, 


ARG 


171, 


LYS 


172, 


LEU 


173, 


ASN 


174, 


ASN 


175, 


ASP 


183, 


GLY 


184, 


LYS 


185, 


GLY 


186, 


TRP 


187, 


ASP 


188, 


TRP 


189, 


GLU 


190, 


THR 


193, 


ASN 


195, 


GLY 


196, 


ASN 


197, 


ASP 


209, 


GLU 


212 , 


ASN 


226, 


GLY 


229, 


ALA 


256, 


THR 


257, 


GLY 


^58-, 


LYS 


^59, 


LYS 


-2-8-1 , 


TYR 


2 98, 


-ASN 


^-99, 


-ALA 


-3 GO, 


LYS 


302, 


SER 


303, 


GLY 


304, 


ASN 


306, 


ARG 


310, 


GLN 


311, 


ASN 


314, 


GLN 


319, 


ARG 


320, 


HIS 


321, 


PRO 


322, 


GLU 


341, 


GLU 


345, 


LEU 


405, 


HIS 


408, 


ARG 


444 , 


ALA 


447, 


GLY 


448, 


GLN 


449, 


THR 


463 , 


ILE 


464, 


ASN 


465, 


ALA 


466, 


ASP 


467. 























The amino acid residues being less than 3.5 A from the 
nearest neighboring amylase molecule are the following 
20 positions identified in the model structure of the AA560 

'amy^^3LS^:~ARG^Y&,^ ARG 158, TYR leO^^RG" 171,"^" " 

LEU 173, ASN 174, GLY 184, TRP 189, GLY 196, ASP 209, ASN 226, 
TYR 298, ASN 299, LYS 302, ASN 306, ARG 310, ASN 314, ARG 320, 
GLU 345, ALA 447, ALA 466. 

25 

Substitutions in alpha -amylases leading to higher solubility 
and thus to higher performance . 

In general, any substitutions disturbing or destroying a 
protein-protein interaction of any kind will lead to higher 

30 solubility and thus to higher performance in a particular 

application. An amino acid residue participating in protein- 
protein interaction may thus be changed to a larger, a smaller, 
m.ore hydrophobic, more hydrophilic, to a charged or to an 
uncharged amino acid leading to higher solubility. 

35 A larger amino acid residue is capable of making a steric 

hindance for the interaction and a smaller amino acid residue 
makes the distance between the interacting residues to large, 
weaking the interaction. Polar interaction and in particular 
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saltbridges are destroyed by changing at least one of the 
involved residues to a hydrophobic residues, by changing at 
least one of the involved residues to a smaller residue making 
the distance for a strong interaction to large, or by changing 
at least one of the involved residues to a larger residue which 
make sterical hindrance. Hydrophobic interactions are 
effectively disturbed by changing one of the interacting amino 
acids to hydrophilic amino acid residues but also smaller or 
larger hydrophobic residues can destroy this kind of 
interaction. Aromatic interactions can be prevented by changing 
to a hydrophilic residue or to a small hydrophobic residue. 

SxaD^tritutrions in alplia - amy la s e s leading to liagRer soliibility 
and/or higher performance, 

22 : X-Q,E; 

25: X-N,D; 

28: X-R,K,Q,H; 

76: X-R:k,Q,H; 

94: X-N,D; 

125:X-N,D; 

128 :X-N,D; 

158 :X-R,K,Q,E,H; 

160:X-Y,H,F,W; 

171 : backbone / 

173:X-F,L,V,A,I; 

174 :X-N,D, DELETION; 

184 :jbac7cbone; 
189:X-W,E,Q; 
209 :X-D,N; 
226 :X-D,N; 
229:X-D,N; 
298:X-Y,F,L,I,H; 
299:X-D,N; 
302 :X-K,R,Q,E,H; 
306:X-D,N; 
310 :X-R; 
314 :X-D,N; 
32 0 : backbone ; 
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345 :X-E,Q; 
405 : backbone; 
44 7 : backbone; 
466: backbone . 

5 All of the above-mentioned mutations are also contemplated 

in the context of variant of the invention. 

MATERIALS AND METHODS 

Enzymes : 

10 SP722: SEQ ID NO: 4, available from Novozymes, and disclosed in 
WO 95/26397. 

AA560: SEQ ID NO: 12; disclosed in WO 00/60060 and available 
-from --Novo zymeS" -~A-/~S-; djrsciosed" irr ^Banxsh patent~" applricatxon no . 
PA 1999 00490; deposited on 25^^ January 1999 at DSMZ and 

15 assigned the DSMZ no. 12649. 

AA560 were deposited by the inventors under the terms of the 
Budapest Treaty on the International Recognition of the Deposit 
of Microorganisms for the Purposes of Patent Procedure at 
Deutshe Sammmlung von Microorganismen und Zellkulturen GmbH 

20 (DSMZ) , Mascheroder Weg lb, D-38124 Braunschweig DE . 



Bacillus subtilis SHA273: see WO 95/10603 
Plasmids 

25 pJEl contains the gene encoding a variant of SP722 alpha- 

amylase (SEQ ID NO: 4): viz. deletion of 6 nucleotides 
corresponding to amino acids D183-G184 in the mature protein. 
Transcription of the JEl gene is directed from the amyL 
promoter. The plasmid further more contains the origin of 

30 replication and cat-gene conferring resistance towards kanamycin 
obtained from plasmid pUBllO (Gryczan, TJ et al . (1978), J. 
Bact . 134 :318-329) . 

Methods : 

35 Construction of library vector pDorKlOl 

The E. coli/ Bacillus shuttle vector pDorKlOl (described 
below) can be used to introduce mutations without expression of 
alpha-amylase in E. coli and then be modified in such way that 
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the alpha-amylase is active in Bacillus. The vector was 
constructed as follows: The JEl encoding gene (SP722 with the 
deletion of D183-G184) was inactivated in pJEl by gene 
interruption in the PstI site in the 5 "coding region of the SEQ 
ID NO: 4: SP722 by a 1.2 kb fragment containing an E. coli 
origin of replication. This fragment was PGR amplified from the 
pUC19 (GenBank Accession #:X02514) using the forward primer: 
5' -gacctgcagtcaggcaacta-3 " and the reverse primer: 5 " - 
tagagtcgacctgcaggcat-3 ' . The PGR amplicon and the pJEl vector 
were digested with PstI at 37oG for 2 hours. The pJEl vector 
fragment and the PGR fragment were ligated at room temperature, 
for 1 hour and transformed in E. coli by electrotransf ormation . 
The resulti"ng vector is designated pDorKl'Ol . 

Filter screening assays 

The assay can be used to screening of Termamyl -like a- 
amylase variants having an improved stability at high pH 
compared to the parent enzyme and Termamyl -like a-^mylase 
variants having an improved stability at high pH and medium 
— temperatures— Gompa-red — to- -the — par=ent —enzyme — depend-i-ng — of-— the— 
screening temperature setting 

High pH filter assay 

Bacillus libraries are plated on a sandwich of cellulose 
acetate (OE 67, Schleicher & Schuell, Dassel, Germany) - and 
nitrocellulose filters (Protran-Ba 85, Schleicher Sc Schuell, 
Dassel, Germany) on TY agar plates with 10 ^g/ml kanamycin at 
370G for at least 21 hours. The cellulose acetate layer is 
located on the TY agar plate. 

Each filter sandwich is specifically marked with a needle 
after plating, but before incubation in order to be able to 
localize positive variants on the filter and the nitrocellulose 
filter with bound variants is transferred to a container with 
glycin-NaOH buffer, pH 8.6-10.6 and incubated at room 
temperature (can be altered from 10o-60°C) for 15 min. The 
cellulose acetate filters with colonies are stored on the TY- 
plates at room temperature until use. After incubation, 
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residual activity is detected on plates containing 1% agarose, 
0.2% starch in glycin-NaOH buffer, pH 8.6-10.6. The assay 
plates with nitrocellulose filters are marked the same way as 
the filter sandwich and incubated for 2 hours. at room 
temperature. After removal of the filters the assay plates are 
stained with 10% Lugol solution. Starch degrading variants are 
detected as white spots on dark blue background and then 
identified on the storage plates. Positive variants are 
rescreened twice under the same conditions as the first screen. 



Low calcium filter assay 

The Bacillus library are plated on a sandwich of 

Germany) - and nitrocellulose filters (Protran-Ba 85, 
15 Schleicher & Schuell, Dassel, Germany) on TY agar plates with a 

relevant antibiotic, e.g., kanamycin or chloramphenicol, at 37°C 

for at least 21 hours. The cellulose acetate layer is located 

on the TY agar plate . 

Each filter sandwich is specifically marked with a needle 
20 after plating, but before incubation in order to be able to 



localize positive variants on the filter and the nitrocellulose 
filter with bound variants is transferred to a container with 
carbonate/bicarbonate buffer pH 8.5-10 and with different EDTA 
concentrations {0.001 mM - 100 mM) . The filters are incubated 

25 at room temperature for 1 hour. The cellulose acetate filters 
with colonies are stored on the TY-plates at room temperature 
until use. After incubation, residual activity is detected on 
plates containing 1% agarose, 0.2% starch in 

carbonate/bicarbonate buffer pH 8,5-10. The assay plates with 

30 nitrocellulose filters are marked the same way as the filter 
sandwich and incubated for 2 hours, at room temperature. After 
removal of the filters the assay plates are stained with 10% 
Lugol solution. Starch degrading variants are detected as white 
spots on dark blue background and then identified on the 

35 storage plates. Positive variants are rescreened twice under 
the same conditions as nhe first screen. 



Method for obtaining the regions of interest : 
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There are three known 3D structures of bacterial cx- 
amylases. Two of B, licheniformis a-^Hiylase, Brookhaven 
database IBPL (Machius et al . (1995), J. Mol . Biol. 246, p. 
545-559) and IVJS (Song et al . (1996), Enzymes for Carbohydrate 
163 Engineering (Prog. Biotechnol . V 12). These two structures 
are lacking an important piece of the structure from the so- 
called B-domain, in the area around the two Calcium ions and 
one Sodium ion binding sites. We have therefore used a 3D 
structure of an a-amylase BA2 (WO 96/23874 which are a hybrid 
between BAN™ (SEQ ID NO. 5) and B. licheniformis a-amylase (SEQ 
ID NO. 4) . On basis of the structure a model of B. 
d-ichend-Jrojrini-s a±pha "amyrase" and" the ■SP722" cT" ^i^y l"ars'e ha's be~en 
build. 

Fermentation and purification of g-amylase variants 

Fermentation and purification may be performed by methods 
well known in the art. 

Stabil ity de termination ____ 

All stability trials are made using the same set up. The 
method is as follows: 

The enzyme is incubated under the relevant conditions (1-4) . 
Samples are taken at various time points, e.g., after 0, 5, 10, 
15 and 30 minutes and diluted 25 times (same dilution for all 
taken samples) in assay buffer (O.IM 50mM Britton buffer pH 
7.3) and the activity is measured using the Phadebas assay 
(Pharmacia) under standard conditions pH 7.3, 3 7°C. 

The activity measured before incubation (0 minutes) is used 
as reference (100%) . The decline in percent is calculated as a 
function of the incubation time. The table shows the residual 
activity after, e.g., 30 minutes of incubation. 

Specific activity determination 

The specific activity is determined using the Phadebas 
assay (Pharmacia) as activity/mg enzyme. The manufactures 
instructions are followed (see also below under "Assay for a- 
amylase activity) . 
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Solubility Determination I 

Purified enzyme is dialyzed against a selected buffer 
overnight. Standard conditions are 0.02 M Tris-HCl^ 0.15 M 
5 NaCl, pH 7.5, room temperature, A preparation, containing 

approximately 4 0 mg enzyme is concentrated on an amicon cell 
using Millepore ultrafiltration membranes, YMIO, NMWL : 10,000. 
Measuring A280 before and during the process followed 
concentration of the enzyme. The concentration step is stopped 
10 when precipitation started to occur. Measurement of activity 
in the supernatant is made to ensure that the precipitate 
actually is amylase. The precipitate is then dissolved again by 
adding buffer at room temperature and the" protein concentration 
is measured. 

15 

Solubility Determination II 

Purified enzyme is dialyzed against a selected buffer 
overnight. Standard conditions are 0.01 M Boric acid, 0.01 M 
KCl, 0.0 02 M CaCl2, 0.15M NaCl pH 7.5. A preparation, containing 

20 approximately 4 0 mg enzyme is concentrated on an amicon cell 

using Millepore ultrafiltration membranes, YMIO, NMWL: 10,000. 
Measuring A280 before and during the process followed 
concentration of the enzyme. The concentration step is stopped 
when precipitation started to occur. Measurement of activity 

25 in the supernatant is made to ensure that the precipitate 

actually is amylase. The precipitate is then dissolved again by 
adding buffer at room temperature and the protein concentration 
is measured. 



30 Assays for Alpha-Amylase Activity 

1 . Phadebas assay 

Alpha-amylase activity is determined by a method employing 

Phadebas® tablets as substrate. Phadebas tablets (Phadebas® 

Amylase Test, supplied by Pharmacia Diagnostic) contain a cross- 
35 linked insoluble blue-colored starch polymer, which has been 

mixed with bovine serum albumin and a buffer 'substance and 

tabletted . 

For every single measurement one tablet is suspended in a 
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tube containing 5 ml 50 mM Britton-Robinson buffer (50 mM acetic 
acid, 50 mM phosphoric acid, 50 mM boric acid, 0.1 mM CaCl2, pH 
adjusted to the value of interest with NaOH) . The test is 
performed in a water bath at the temperature of interest . The 
5 alpha-amylase to be tested is diluted in x ml of 50 mM Britton- 
Robinson buffer. 1 ml of this alpha-amylase solution is added to 
the 5 ml 50 mM Britton-Robinson buffer. The starch is hydrolyzed 
by the alpha-amylase giving soluble blue fragments. The 
absorbance of the resulting blue solution, measured 
10 spectrophotometrically at 620 nm, is a function of the alpha- 
amylase activity. 

It is important that the measured 62 0 nm absorbance after 10 
or 15 minute's of incubation (testing time) is in the' range of 
0.2 to 2.0 absorbance units at 620 nm. In this absorbance range 
15 there is linearity between activity and absorbance (Lambert-Beer 
law) . The dilution of the enzyme must therefore be adjusted to 
fit this criterion. Under a specified set of conditions (temp., 
pH, reaction time, buffer conditions) 1 mg of a given alpha- 
amylase will hydrolyze a certain amount of substrate and a blue 
20 colour will be produced. The colour intensity is measured at 62 0 
nm. The measured absorbance is directly proportional to the 
specific activity (activity/mg of pure alpha-amylase protein) of 
the alpha-amylase in question under the given set of conditions. 
2. Alternative method 

Alpha-amylase activity is determined by a method employing 
the PNP-G7 substrate. PNP-G7 which is a abbreviation for p- 
nitrophenyl -alpha, D-maltoheptaoside is a blocked oligosaccharide 
which can be cleaved by an endo-amylase . Following the cleavage, 
the alpha-Glucosidase included in the kit digest the substrate 
to liberate a free PNP molecule which has a yellow colour and 
thus can be measured by visible spectophometry at A,=405nm. (400- 
420 nm.). Kits containing PNP-G7 substrate and alpha-Glucosidase 
is manufactured by Boehringer-Mannheim (cat. No. 1054635). 

To prepare the substrate one bottle of substrate (BM 
1442309) is added to 5 ml buffer (BM1442309) . To prepare the a- 
Glucosidase one bottle of alpha-Glucosidase (BM 1462309) is 
added to 45 ml buffer (BM1442309) . The working solution is made 
by mixing 5 ml alpha-Glucosidase solution with 0 . 5 ml substrate. 
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The assay is performed by transforming 20/il enzyme solution 
to a 96 well microtitre plate and incubating at 25 °C. 200 fil 
working solution, 25°C is added. The solution is mixed and pre- 
incubated 1 minute and absorption is measured every 15 sec. over 
5 3 minutes at OD 4 05 nm. 

The slope of the time dependent absorption- curve is directly 
proportional to the specific activity (activity per mg enzyme) 
of the alpha-amylase in question under the given set of 
conditions . 

10 

General method for random mutagenesis by use of the DOPE 
program 

The random mutageneses may be ~cax"ri~ed~ out as f oiiows ; 

1. Select regions of interest for modification in the parent 
15 enzyme 

2. Decide on mutation sites and non-mutated sites in the 
selected region 

3. Decide on which kind of mutations should be carried out, 
e.g. with respect to the desired stability and/or performance 

20 of the variant to be constructed 

4. Select structurally reasonable mutations. 

5. Adjust the residues selected by step 3 with regard to step 
4 . 

6. Analyze by use of a suitable dope algorithm the nucleotide 
25 distribution. 

7. If necessary, adjust the wanted residues to genetic code 
realism (e.g., taking into account constraints resulting from 
the genetic code (e.g. in order to avoid introduction of stop 
codons) ) (the skilled person will be aware that some codon 

30 combinations cannot be used in practice and will need to be 
adapted) 

8 . Make primers 

9. Perform random, m.utagenesis by use of the primers 

10. Select resulting a-^n^ylase variants by screening for the 
35 desired improved properties. 

Suitable dope algorithms for use in step 6 are well known 
in the art. One algorithm is described by Tomandl , D. et al . , 
Journal of Computer-Aided Molecular Design, 11 (1997) , pp. 29- 
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38) . Another algorithm, DOPE, is described in the following: 

The dope program 

The "DOPE" program is a computer algorithm useful to 
5 optimize the nucleotide composition of a codon triplet in such 
a way that it encodes an amino acid distribution which 
resembles most the wanted amino acid distribution. In order to 
assess which of the possible dist ributions is the most similar 
to the wanted amino acid distribution, a scoring function is 
10 needed. In the "Dope" program the following function was found 
to be suited: 



=n 



15 where the x- ' s are the obtained amounts of amino acids and 
groups of amino acids as calculated by the program, ' s are 
the wanted amounts of amino acids and groups of amino acids as 

d^jjied_ bX-tlLe^useii^of^jtl^^ Le_._g.^ _sp_eci^y_. which _of„tJie_ 

2 0 amino acids or stop codons are wanted to be introduced, e.g. 

20 with a certain percentage (e.g. 90% Ala, 3% lie, 7% Val) , and 
w,'s are assigned weight factors as defined by the user of the 
program (e.g., depending on the importance of having a specific 
amino acid residue inserted into the position in question) . N 
is 21 plus the number of amino acid groups as defined by the 

25 user of the program. For purposes of this function 0° is 
defined as being 1. 

A Monte -Carlo algorithm (one example being the one 
described by Valleau, J. P. & Whittington, S.G. (1977) A guide 
to Mont Carlo for statistical mechanics: 1 Highways. In 

30 "Stastistical Mechanics, Part A" Equlibrium Techniqeues ed. 
B.J. Berne, New York: Plenum) is used for finding the maximum 
value of this function. In each iteration the following steps 
are performed: 

l.A new random nucleotide composition is chosen for each base, 
35 where the absolute difference between the current and the new 
composition is smaller than or equal to d for each of the 



10004 .010-DK 



43 

four nucleotides G,A,T,C in all three positions of the codon 
(see below for definition of d) . 
2. The scores of the new composition and the current composition 
are compared by the use of the function s as described above. 
5 If the new score is higher or equal to the score of the 
current composition, the new composition is kept and the 
current composition is changed to the new one. If the new 
score is smaller, the probability of keeping the new 

composition is Qxp{\OOOi^new_score - current _score)^ . 

10 A cycle normally consists of 1000 iterations as described 

above in which d is decreasing linearly from 1 to 0. One 
.hundred or more cycles ar_e perf ormed in an optimization 
process. The nucleotide composition resulting in the highest 
score is finally presented. 

15 

EXAMPLES 
EXAMPLE 1 

Method of extracting important regions and amino acid residues 

-^20 f or— identi-fying S-P722 a-lpha^amyl-ase varrants with' — a-l-tered- 

properties . 

The tertiary structure of the SP722 alpha-amylase was 
displayed on a UNIX computer station running Insight and all 
hydrogens and water molecules were deleted. For this minimized 

25 amylase structure, all the symetri related amylase molecules 
were displayed, according to the crystal parameters and the 
spacegroup determined in the pdb- structure file (Appendix 1) . A 
small computer program was made to calculate the distance from 
each atom in the mother alpha-amylase to any atom in the symetri 

30 related amylase structures. Setting the distance cut-off to 3.5 
A and 6.0 A, respectively resulted in the lists below; 

The amino acid residues being less than 6.0 A from the 
nearest neighboring amylase molecule are the following 
positions identified in the model structure of the SP722 

35 amylase: 

ASN 19, ASP 20, GLY 21, GLN 22, ASN 25, ARG 28, ASP 29, GLN 53, 
ARG 76, GLN 84, SER 87, HIS 90, LYS 93, ASN 94, ASN 125, ASN 
126, ASN 128, LYS 142, ASP 144, LYS 156, TRP 157, ARG 158, TRP 
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159, 


TYR 


160, 


HIS 


161, 


PHE 


173, 


GLN 


174, 


ASN 


175, 


ASP 


183 , 


GLY 


184, 


LYS 


185, 


ALA 


186, 


TRP 


187, 


ASP 


188, 


TRP 


189, 


GLU 


190, 


SER 


193, 


GLY 


196, 


ASN 


197, 


ASP 


209, 


GLU 


212, 


ASN 


226, 


ASN 


229, 


ALA 


256, 


THR 


257, 


GLY 


258, 


LYS 


259, 


ASN 


280, 


LYS 


281, 


TYR 


298, 


ASN 


299, 


ALA 


300, 


ASN 


302, 


SER 


303, 


GLY 


304, 


GLY 


305, 


ASN 


306, 


ALA 


310, 


ASN 


314, 


GLN 


319, 


LYS 


320, 


HIS 


321, 


PRO 


322, 


GLU 


341, 


GLN 


345, 


PHE 


405, 


ASP 


406, 


HIS 


408, 


GLN 


444, 


ALA 


447, 


GLY 


448 , 


GLN 


449, 


THR 


463, 


ILE 


464, 


ASN 


465, 


ALA 


466, 


ASP 


467. 









The amino acid residues being less than 3.5 A from the 
nearest neighboring amylase molecule are the following 
positions identified in the model structure of the SP722 
amylase: 



-GLN- -2-2 , ASN- -2 5, ARG -2 8, ARG 76, ASN 94, ASN 1-2-5 , ASN 1-2 8 , -ARG 

158, TYR 160, PHE 173, GLN 174, GLY 184, TRP 189, ASP 209, ASN 

226, ASN 229, TYR 298, ASN 299, ASN 302, ASN 306, ASN 314, LYS 

320, GLN 345, PHE 405, ALA 447, ALA 466. 

EXAMPLE 3 

Alternative method of extracting important regions and amino 
acid residues for identifying SP722~alpha- amylase" 'vaE^^^ with' 
altered properties. 

The tertiary structure of the SP722 alpha-amylase was 
displayed on a UNIX computer station running Insight and all 
hydrogen and water molecules were deleted. This minimized 
amylase structure was imported ("getmol") into the WHAT IF 
software and all the symmetry related amylase molecules within 
6 angstrom were added to the soup ("symtry, symspg, sympar 6, 
soushl , soup ). A new coordinate file for SP722 including the 
symmetry molecules was written (makmol, tot 0). The center 
SP72 2 amylases was then deleted from the structure file, and a 
new file including the surrounding molecules was written. A 
small computer program was made to calculate the distance from 
each atom in the mother alpha-amylase (original SP72 structure 
file without water molecules and Hydrogen atoms) to any atom in 
the symmetry related amylase file. Setting the distance cut-off 
to 3.5 A and 6.0 A, respectively resulted in identification of 
the position referred to in the description. 
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10 



what If commands: 
>>whatif << 

>>Getmol sp722_HOH .pdb<< 



15 



Starts WHAT IF 

import the water and H-depleted SP722 structure 
file into the WHAT IF sotware 
Go to symmetry menu 

Reads symmetry related information in structure 
file 

Makes symmetry related residues within 6 A 
distance 

Adds the symmetry molecules to the soup 
Go to soup menu 

>>makmol, "sp722_wi6 . pdb" , tot 0<< writes the ''sp722_wi6" 
pdb file including all atoms in the soup (SP722 and the 
symmetry molecules within 6 A) 



>>syTntry<< 
>>symspg<< 

>>sympar 6<< 

>>soushl<< 
>>soup<< 



EXAMPLE 4 

20 Homology buildincr of AA560 from SP722 Tertiary structure 

The overall homology of the AA560 alpha -amylase (SEQ ID NO: 

12) to SP722 (SEQ ID NO: 4) is about 87% as described above. 
- ^ _The_ tertiar y structure of the AA 560 alpha- amyl.ase_ was model. 

built on the structure disclosed in Appendix 1 herein, 

25 

EXAMPLE 5 

Method of extracting important regions for identifying AA56Q 
alpla-amylase variants with altered altered properties 
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The model build structure of AA560 was subjected to the 
same calculations as was SP722 in Example 1. Because the 
homology of the two alpha-amylases is as high as about 87% 
identity, the crystal interactions of AA560 are expected to 
similar to those of SP722, i.e., AA560 crystalize in the same 
spacegroup as SP722 and with the similar unity cell parameter. 

The tertiary structure of the AA560 alpha-amylase was 
displayed on a UNIX computer station running Insight and all 
hydrogens and water molecules were deleted. For this minimized 
amylase structure, all the symetri related amylase molecules 
were displayed, according to the crystal parameters and the 
spacegroup for SP722 (Appendix A) . A small computer program was 
made to calculate the distance from each atom in the "mather" 
alpha-amylase to any atom in the symetri related amylase 
structures. Setting the distance cut-off to 3.5 A and 6.0 A, 
respectively resulted in the lists below: 

The amino acid residues being less than 6.0 A from the 
nearest neighboring amylase molecule are the following 
positions identified in the model structure of the AA560 
amylase; ASN 19, ASP 20, ASN 22, ASN 25, ARG 28, SER 29, GLN 
5 3 , ARG 767 GLN ^7 ALa""87T^ASN "9"oT LYS 9 3T"SER-9-4-,- PRO^l^ 4t 
ASN 125, ASN 126, ASN 128, LYS 142, ASP 144, LYS 156, TRP 157, 
ARG 158, TRP 159, TYR 160, HIS 161, SER 170, ARG 171, LYS 172, 
LEU 173, ASN 174, ASN 175, ASP 183, GLY 184, LYS 185, GLY 186, 
TRP 187, ASP 188, TRP 189, GLU 190, THR 193, ASN 195, GLY 196, 
ASN 197, ASP 209, GLU 212, ASN 226, GLY 229, ALA 256, THR 257, 
GLY 258, LYS 259, LYS 281, TYR 298, ASN 299, ALA 300, LYS 302, 
SER 303, GLY 304, ASN 306, ARG 310, GLN 311, ASN 314, GLN 319, 
ARG 320, HIS 321, PRO 322, GLU 341, GLU 345, LEU 405, HIS 408, 
ARG 444, ALA 447, GLY 448, GLN 449, THR 463, ILE 464, ASN 465, 
ALA 466, ASP 467. 
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The amino acid residues being less than 3.5 A from the nearest 
neighboring amylase molecule are the following positions 
identified in the model structure of the AA560 amylase: ARG 28, 
ASN 125, ASN 128, ARG 158, TYR 160, ARG 171, LEU 173, ASN 174, 
GLY 184, TRP 189, GLY 196, ASP 209, ASN 226, TYR 298, ASN 299, 
LYS 302, ASN 306, ARG 310, ASN 314, ARG 320, GLU 345, ALA 447, 
ALA 466. 

EXAMPLE 6 

Construction, by localized random, doped mutagenesis, of AA56Q 
alpha-amylase variants havinc? increased solubility in comparison 
to the parent enzyme 

To increase the solubility of the .AA5.60- alpha--am.ylase 
random mutagenesis in pre-selected region was performed as 
described in the following. 
Region: Residue: 
SAX: R181-W189 

The DOPE software (see "Materials and Methods") was used to 
determine spiked codons for each suggested change in the SAl 
region minimizing the amount of stop codons (see table 1) . The 
" exa'c t^ ~~d i"s t r 1 bu tion of" "llucTleo t~ilies^^as c a Icul'a t: eTi — in t he ^ t hre~e~ 
positions of the codon to give the suggested population of 
amino acid changes. The doped regions were doped specifically 
in the indicated positions to have a high chance of getting the 
desired residues, but still allow other possibilities. 

Table 1: 

Distribution of amino acid residues for each position 
R181: 72% R, 2% N, 7% Q, 4% H, 4%K, 11%S 
G182: 73% G, 13% A, 12% S, 2% T 
K185: 95% K, 5% R 

G186: 50% A, 4% N, 6% D, 1%E, 1% G, 1% K, 5% S, 31% T 
W187: 100% W 
D188: 100% D 
W189: 92% W, 8% S 

The resulting doped oligonucleotide strand is shown in 
table 2 as sense strand: with the wild type nucleotide and 
amino acid sequences and the distribution of nucleotides for 
each doped position. 



Table 2 : 
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Position 181 182 185 186 187 188 189 

Amino acid seq. Arg Gly Lys Gly Trp Asp Trp 

Wt nuc. seq. aga ggt aaa ggg tgg gat tgg 

Forward primer (SEQ ID NO: 14) : 

FSA: S^-caa aat cgt ate tac aaa ttC 123 456 a7g 8910 tgg 

gat tllg gaa gta gat teg gaa aat-S" 

Distribution of nucleotides for each doped Position 
1: 35% A, 65% C 



2 : 83% G, 17% A 
3 : 63% G, 37% T 
4 : 86% G, 14% A 

15 5: 85% G, 15% C 
6: 50% T, 50% C 
7: 95% A, 5%G 
8: 58% G, 37% A, 5% T 
9 : 86%" C, 13"%' "A, 1% G 

20 10: 83% T, 17% G 
11: 92% G, 8% C 



Reverse primer (SEQ ID NO: 15) : 
RSA: 5"-gaa ttt gta gat acg att ttg-3' 

Random mutagenesis 

The spiked oligonucleotides apparent from Table 2 (which by 
a common term is designated FSA) and reverse primers RSA for 
the SA^ ~ "region ~ andf "spec if Tc~ ~SEQ ~ IB ^ NO": ~ TST ' AA5 6'0 p"rimer s^ 

30 covering the SacII and the Dralll sites are used to generate 
PGR- library- fragments by the overlap extension method (Horton 
et al . , Gene, 77 (1989), pp. 61-68) with an overlap of 21 base 
pairs. Plasmid pJEl is template for the Polymerase Chain 
Reaction. The PGR fragments are cloned in the E. coli/ Bacillus 

35 shuttle vector pDorklOl (see "Materials and Methods" section) 
enabling mutagenesis in E. coli and immediate expression in 
Bacillus subtilis preventing lethal accumulation of amylases in 
E. coli. After establishing the cloned PGR fragments in E. 
coli, a modified pUC19 fragment is digested out of the plasmid 

40 and the promoter and the mutated Termamyl gene is physically 
connected and expression can take place in Bacillus . 

Screening 

The library may be screened in the low calcium filter 
45 assays described in the "Material and Methods" section above. 
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EXAMPLE 7 

Construction of variants o£ AA560 SEQ ID NO: 12 

The gene encoding the AA560 alpha-amylase shown in SEQ ID 
NO: 12 is located in a plasmid pTVB223. The amylase is 
5 expressed from the amyL promoter in this construct in Bacillus 
suhtilis . 

A variant of the invention with delta (D183 -G184 ) mutations 
was constructed by the mega-primer method as described by, 
Sarkar and Sommer, (1990), BioTechniques 8: 404-407. 
10 Gene specific primer Bl (SEQ ID NO: 16) and mutagenic 

primer 101458 (SEQ ID NO: 18) were used to amplify by PGR an 
approximately 645 bp DNA fragment from a pTVB2 23 plasmid 
-encoding- -AA-5 6 0 shown an SEQ -PD NO : -1-2 -) . 

The 64 5 bp fragment was purified from an agarose gel and 
15 used as a mega-primer together with primer Y2 (SEQ ID NO: 17) 
in a second PGR carried out on the same template. 

The resulting approximately 1080 bp fragment was digested 
with restriction enzymes BstEII and Afllll and the resulting 
approximately 510 bp DNA fragment was purified and ligated with 
20 the pTVB223 plasmid digested with the same enzymes. Competent 
Bacillus subtilis SHA273 (amylase and protease low) cells" were 
transformed with the ligation and Ghlorampenicol resistant 
transf ormants and was checked by DNA sequencing to verify the 
presence of the correct mutations on the plasmid. 
25 primer Bl : (SEQ ID NO: 16) 

5' GGA TTG GTG AGG GTG TTA TTT GCG 3' 
primer Y2 : (SEQ ID NO: 17) 
5' CTT GTT GGG TTG TGA GAA CCA ATG 3' 
primer 101458 (SEQ ID NO: 18) : 
30 5' GT CAT AGT TGC GGA AAT CTG TAT CGA CTT C 3' 

The resulting plasmid encoding the AA560 amylase with 
delta (D183-G184) +N195F was named pTVB232 . 

The construction of the other variants of the invention was 
carried out in a similar manner. 

35 

Example 8 

Determination of solubility of AA560 and SP722 variants 



AA560 variant was constructed as described above. The 
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solubility was determined as descred in the "Material and 
Methods" section as Solubility Determination I 



Variant 


Solubility 
(mg/ml 


Wild-type AA560 


2 mg/ml 


AA560 delta (D183-G184) +N195F+N445Q-I-N446N (AX23 ) 


more than 6 
mg/ml 



0.02 M Tris-HCl, 0.15 M NaCl , pH 7.5, room temperature 



5 The concentration step was started with a protein amount of 

4 0 mg and in general, 50% of the amylases were lost during the 
process. This loss of protein occurred for all enzymes in the 
initial phase of the process and full recovery of the enzymes 
were obtained in the terminating phase where the volumes are 
10 become very limited. Loss of enzymes was due to adsorption to 
the membrane . 

Example 9 : 

Determination of solubility of AA560 variants 

_15 AA5.6.0_-variants-_were^_cons-tructed.-as -described, -above., __u si ng. the_ _ 

wild- type AA560 as template. The solubility was determined as 
described in the "Material and Methods" section under 
Solubility Determination II 



Mutations 


Solubility (g/L) 


Wild type AA560 


2.8 


AA560+N125A 


9 


AA560+N125R 


9 


AA560+N306R 


7 


AA56 0+Y2 98H+N2 99A+K3 02R+S3 03Q+N3 06G 


12 


AA5 6 0+delta (D183+G184) , R181Q, E345R 


12 


AA560+delta (D183+G184) , R181Q, W18 9R 


4.5 


AA56 0+delta (D183+G184) , R181Q, S94K, W18 9R 


7 


AA560+delta (D183+G184) , N195F, R118K-fR320K+R458K 


10 



10004 .010-DK 



51 



AAbbO+aeica iDlo3+G184; , N195F, W189K/ N306K, N445K 


10 


AAdd U+aei ca \ u±o^ o4 ; ,N195F,R118K/N125K, R4 44 K, 
N445K 


3 


AA560+delta (D18 3+G184) , N195F , W18 9K, N44 5K 


12 


AA560 4-delta (D18 3+G184) ,N195F, R4 0 0T 


4 


AA560+delta (D18 3+G184) ,N195F, W43 9R 


5 . 5 


AAbbO+delta (D183+G184 ) , N195F, Q44 9E 


3 . 4 


AA560+delta (D18 3+G184) ,N195F,N484Q 


4 . 3 


AA5bO+aelta (D183+G184) , N195F, R181Q, N44 5Q, K446N, 


4 .HL 5 


AA560+delta (D183+G184) , N195F, R181E, N445Q, K446N 


3 . 5 


AA560+delta (D18 3+G184) , N195F, R181E, K44 6N 


5.2 


AA560-hdelta (D18 3+G184) , N195F, R310A, R181Q, 
N445Q, K446N 


5 


AA560+delta {D183+G184) , N195F, R320K, R181Q, 
-N44 5Q 7-K44-6N- 


6~.-9 


AA560+delta (D18 3+G184) , N195F, Q319K, R3 2 0D, 
R181Q,N44 5Q, K44 6N 


4 


AA560+delta (D18 3+G184) , N195F, N3 06A, R181Q, 
N445Q, K446N 


5 . 5 


AA560+delta (D18 3+G184) , N195F, K3 02N, R181Q, 
N445Q, K446N 


8 


AA560+delta (D18 3+G184) , N195F, E34 5N, R181Q, 
N445Q, K446N 


3 . 5 


AA560+delta (D18 3+G184) , N195F, Y2 98F, R181Q, 
N445Q, K446N 


5 


AA560+delta (D183+G184) , N195F, R2 8N, R181Q, 
N445Q, K446N 


4 .5 


AA560+delta (D18 3+G184) ,N195F, R2 8N, R310A, 
R181Q,N44 5Q,K44 6N 


11 
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AA5 60+delta (D18 3+G184) , N195F, N12 8D, N3 0 6D, 
R181Q,N44 5Q, K44 6N 
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CLAIMS 

1. A variant of a parent Termamyl -like alpha-amylase , 
comprising an alteration at one or more positions selected from 
the group of : 

5 R28, R118, N174; R181, G182, D183, G184, G186, W189, N195, M202, 
Y298, N299, K302, S303, N306, R310, M314; R320, H324, E345, 
Y396, R400, W439, R444, lSr445, K446, Q449, R458, N471, N484, 
wherein 

(a) the alteration (s) are independently 
10 (i). an insertion of an amino acid dovmstream of the amino 

acid which occupies the position, 

(ii) a deletion of the amino acid which occupies the 
posxtion, or 

(iii) a substitution of the amino acid which occupies the 
15 position with a different amino acid, 

(b) the variant has alpha-amylase activity and (c) each position 
corresponds to a position of the amino acid sequence of the 
parent Termamyl-like alpha-amylase having the amino acid 
sequence of AA560 shown in SEQ ID NO: 12. 
20 2. The variant of claim 1, which variant has one or more of 

the following mutations: 

Delta G184; Delta (R181-G182); Delta (D183-G184); 
R28N,K; S94K; R118K; N125A,R,K; N174D; R181Q,E,K; G186R; 
W189R,K; N195F; M202L; Y298H,F; N299A; K302R, S303Q, 
25 N306G,D,R,K; R310A, K, Q, E , H, D, N; N314D; R320K; H324K; 

E345R,D,K,N; Y396F; R400T,K; W439R; R444K; N445K,Q; K446N; 
Q449E; R458K; N471E; N484Q. 

3. The variant of claim 1-2, wherein the variant further 
comprises a mutation in M202, in particular M202L,T. 
30 4. The variant of claim 1-3, wherein the variant further 
comprises a mutation in N195, inparticular N195F. 

5. The variant of claim 1-4, wherein the variant further 
comprises a m.utation in G1S6, in particular G186R. 

6. The variant of claim 1-5, wherein the variant further 
35 comprises a mutation in R181, in particular R181Q. 

7. The variant of any of claims 1-6, wherein the parent 
Termamyl-like alpha-amylase is derived from a strain of B. 
licheniformis, B, awyloliquefaciens, B. stearottiQirmophilus , 
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Bacillus sp. NCIB 12289, NCIB 12512, NCIB 12513 or DSM 9375, or 
DSMZ no. 12649, KSM AP1378, or KSM K36 or KSM K38. 

8. The variant of claims 1-7, wherein the parent Termamyl-like 
alpha-amylase is any of the alpha-amylases selected from the 

5 group depicted in SEQ ID NOS : 2, 4, 6, 8, 10, 12, and 13. 

9 . The variant according to any of claims 1-8 , wherein the 
parent Termamyl-like alpha-amylase has an amino acid sequence 
which has a degree of identity to SEQ ID NO: 4 of at least 60%, 
preferably 70%, more preferably at least 80%, even more 

10 preferably at least about 90%, even more preferably at least 
95%, even more preferably at least 97%, and even more preferably 
at least 99%. 

1~0 , The varxant of any of claims" 1- 9 , wherein the parent 
Termamyl-like alpha-amylase is encoded by a nucleic acid 
15 sequence, which hydridizes under low, preferably medium, 
preferred high stringency conditions, with the nucleic acid 
sequence of SEQ ID NO: 11. 

11. The variant of claims 1-10, which variant has altered 
solubility, preferably increased solubility, in particular under 

20 washing, dish washing or hard surface cleaning conditions. 

12 . A DNA construct comprising a DNA sequence encoding an alpha^ 
amylase variant according to any one of claims 1 to 11. 

13 . A recombinant expression vector which carries a DNA con- 
struct according to claim 12 . 

25 14 , A cell which is transformed with a DNA construct according 
to claim 9 or a vector according to claim 10. 

15. A cell according to claim 14, which is a microorganism, 
preferably a bacterium or a fungus, in particular a gram-posi- 
tive bacterium, such as Bacillus subtilis, Bacillus lichenifor- 

30 mis, Bacillus lentus, Bacillus hrevis, Bacillus stearothermo- 
philus, Bacillus alkalophilus , Bacillus amyloliquefaciens, 
Bacillus coagulans, Bacillus circulans, Bacillus lautus or Ba- 
cillus thuringiensis . 

16. A composition comprising a alpha-amylase variant of claims 
35 1-11. 

17. A detergent additive comprising an alpha-amylase variant ac- 
cording to any one of claims 1 to 11, optionally in the form of 
a non-dusting granulate, stabilised liquid or protected enzyme. 
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18. A detergent additive according to claim 17, which contains 
0.02-200 mg of enzyme protein/g of the additive. 

19. A detergent additive according to claims 17 or 18, which 
additionally comprises another enzyme such as a protease, a 

5 lipase, a peroxidase, mannanase, maltogenic amylase, CGTase, 
amylase or another amylolytic enzyme, such as glucoamylase , 
and/or a cellulase. 

20. A detergent composition comprising an alpha-amylase variant 
according to any of claims 1 to 11. 

10 21. A detergent composition according to claim 20, which addi- 
tionally comprises another enzyme such as a protease, a lipase, 
a peroxidase, mannanase, maltogenic amylase, CGTase, another 
amyloiytic enzyme and/or a cellu-l-ase. 

22. A manual or automatic dishwashing detergent composition 
15 comprising an alpha-amylase variant according to any of claims 1 

to 11. 

23. A dishwashing detergent composition according to claim 22, 
which additionally comprises another enzyme such as a protease, 
a lipase, a peroxidase, mannanase, maltogenic amylase, CGTase, 

20 amylase or another amylolytic enzyme, such as glucoamylase, 
and/or a cellulase. ^ ' 

24 . A manual or automatic laundry washing composition comprising 
an alpha-amylase variant according to any of claims 1 to 11. 

25. A laundry washing composition according to claim 24, which 
additionally comprises another enzyme such as a protease, a 
lipase, a peroxidase, mannanase, maltogenic amylase, CGTase, an 
amylase and/or another an amylolytic enzyme, such as 
glucoamylase and/or a cellulase. 

26. Use of an alpha-amylase variant according to any one of 
claims 1 to 11 or a composition according to claims 16 to 25 for 
washing and/or dishwashing. 

27. Use of an alpha-amylase variant according to any one of 
claim.s 1 to 8 or a composition according to claim.s 16 to 2 5 for 
textile desizing. 

28. Use of an alpha-amylase variant according to any of claims 1 
to 11 or a composition according to claims 16 to 25 for starch 
liquefaction. 

29. A method of producing a variant of a parent alpha-amylase 
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having an altered property relative to the parent, wherein the 
parent alpha-amylase has the sequence of SEQ ID Nos : 2, 4, 6, 
8, 10, 12 or 13, or has a sequence at least 60 % identity to 
said sequence, said method comprising 
5 (a) modelling the parent alpha-amylase on the three- 

dimensional structure of SEQ ID NO : 4 depicted in the Appendix 1 
to produce a three-dimensional structure of the parent alpha- 
amylase ; 

(b) identifying in the three-dimensional structure obtained 
10 in step (a) at least one structural part of the parent wherein 
an alteration in said structural part is predicted to result in 
said altered property; 

ic) -mod-i-f^y-i-ng- -t-he sequence of a -nuelei-e acid eneodi-ng the 
parent alpha-amylase to produce a nucleic acid encoding a 
15 deletion, insertion, or substitution of one or more amino acids 
at a position corresponding to said structural part; and 

(d) expressing the modified nucleic acid in a host cell 
to produce the variant alpha-amylase, 

wherein the variant has alpha-amylase enzymatic activity and 
20 has at least one altered property relative to the parent. 

3 0 . The method of c laim ~^2^," " wher^i nT ~t he" ' a It e~r ed" property is 
selected from the group consisting of substrate specificity, 
substrate binding, substrate cleavage pattern, temperature 
stability, pH dependence of enzymatic activity, pH dependence 
25 of stability, stability towards oxidation, Ca^" -dependency, 
specific activity or altered solubility, preferably increased 
solubility. 

31, A method of constructing a variant of a parent alpha- 
amylase having an altered property relative to the parent, 
30 wherein the parent alpha-amylase has the sequence of SEQ ID 
Nos: 2, 4, 6, 8, 10, 12 or 13 or has a sequence at least 60 % 
identity to said sequence, said method comprising 

(a) modelling the parent alpha-amylase on the three- 
dimensional structure of SEQ ID NO: 4 depicted in the Appendix 

35 1 to produce a three-dimensional structure of the parent alpha- 
amylase ; 

(b) comparing the three-dimensional structure obtained in 
step (a) with a three-dimensional structure of an unrelated 
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alpha-amylase, wherein the unrelated alpha-amylase differs from 
the parent alpha-amylase in said property; 

(c) identifying a structural part of the three-dimensional 
structure obtained in step (a) which is different from the 

5 three-dimensional structure of the unrelated alpha-amylase and 
which is predicted to be relevant to said property, 

(d) modifying the sequence of a nucleic acid encoding the 
parent alpha-amylase to produce a nucleic acid encoding a 
deletion, insertion, or substitution of one or more amino acids 

10 at a position corresponding to said structural part; and 

(e) expressing the modified nucleic acid in a host cell to 
produce the variant alpha-amylase, 

wherelii the vari-ant has al-pha-amy-lase activity „and has one or 
more altered properties as compared to the parent alpha- 
15 amylase. 

32. The method of claim 31, wherein, in step (d) , the 
modification is a deletion, substitution, or insertion of one 
or more amino acid residues in the parent alpha-amylase so that 
the structural part in the variant corresponds to the 
20 structural part in the unrelated alpha amylase. 

33 .~ A^methodT oF incr~easrng" the " solubirity-of ~ p'o-lypep trde 
crystals, in particular enzyme crystals, wherein one or more 
amino acid residue (s), 

1) located within a distance of 6.0 A of a neighbouring, enzyme 
25 molecule in the tertiary crystal structure, and 

2) interacting with said neighbouring enzyme molecule, 
are mutated. 

34. The method of claim 33, wherein the amino acid residues to 
be mutated are located with 3 . 5 A of a neighbouring enzyme. 

30 35. The method of claim 33 or 34, wherein the enzyme has 
protease, amylase, CGTase, mannanase, maltogenic amylase, 
glucoamylase carbohydrase , transferase, lyase, oxidoreductase , 
lipase, activity. 

3 6. The method of claim 32, wherein the amylase is an alpha- 
35 amylase, such as Termamyl -like amylase. 

37. The method of claims 33-36, wherein the polypeptide is a 
protein having biological activity 
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38. The method of claims 33-37, wherein the polypeptide has 
antimicrobial activity. 

39. The method of claim 3-38, wherein the polypeptide is 
selected from the group comprising: insulin, growth hormone, 

5 EPO, TPO, factor VII, Factor VIII. 

40. A method of increasing the solubility of Termamyl-like 
alpha-amylase crystals, wherein one or more amino acid 
residue (s) , 

1) located within a distance of 6.0 A of a neighbouring 
10 Termamyl-like alpha-amylase molecule in the tertiary crystal 
structure, and 

.?)_ inter acting with said neighbouring Termamy l - 1 ike alpha- 
amylase molecule, 

are mutated. 

15 41. The method of claim 40, wherein the amino acid residues to 
be mutated are located with 3 . 5 A of a neighbouring Termamyl- 
like alpha-amylase. 

42- The method of claims 40 or 41, wherein the crystal 

-st~ructure- -of- SP7-2-2--depic ted— i-n-APPENDI-X--l^^ used-^as -a- 

20 reference for determining the distance between neighbouring 

alpha-amylases , 

43. The method of claims 4 0-42, wherein the crystal structure 
is a structure modelled in accordance with any of claims 26-29 
from of SP722 depicted in APPENDIX 1 is used as a reference for 

25 determining the distance between neighbouring alpha-amylases 

44 . The method of claims 4 0-43, wherein the Termamyl - like 
alpha-amylase include amylases selected from the group of 
alpha-amylase derived from a strain of B. lichenifoirmis, B. amy- 
loliquGfaciens, B. stearothermophilus. Bacillus sp . NCIB 12289, 

30 NCIB 12512, NCIB 12513 or DSM 9375, or DSMZ no. 12649, KSM 
AP1378, K38, K36. 

45. The method of claim 44, wherein the Termamyl-like alpha- 
amylase is any of the alpha-amylases selected from the group 
depicted in SEQ ID NOS : 2, 4, 6, 8, 10, 12, and 13. 

35 46. The methof of any of claims 4 0-45, wherein the Termamyl-like 
alpha-amylase has an amino acid sequence which has a degree of 
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identity to SEQ ID NO: 4 of at least 60%, preferably 70%, more 
preferably at least 8 0%, even more preferably at least about 
90%, even more preferably at least 95%, even more preferably at 
least 97%, and even more preferably at least 99%. 
5 47. The method of any of claims 4 0-46, wherein the Termamyl-like 
alpha-amylase is encoded by a nucleic acid sequence, which 
hydridizes under low, preferably medium, preferred high 
stringency conditions, with the nucleic acid sequence of SEQ ID 
NO: 11. 

10 48. The method of claims 40-47, wherein the Termamyl-like 

alpha-amylase amino acid residues being less than 6.0 A from 
the nearest neighbouring amylase molecule are the following 
(using the SP722" numbering) : 

19, 20, 21, 22, 25, 28, 29, 53, 76, 84, 87, 90, 93, 94, 124, 
15 125, 126, 128, 142, 144, 156, 157, 158, 159, 160, 161, 170, 
171, 172, 173, 174, 175, 183, 184, 185, 186, 187, 188, 189, 
190, 193, 195, 196, 197, 209, 212, 226, 229, 256, 257, 258, 
259, 280, 281, 298, 299, 300, 302, 303, 304, 305, 306, 310, 
311, 314, 319, 320, 321, 322, 341, 345, 405, 406, 408, 444, 
20 447, 448, 449, 463, 464, 465, 466, 467. 

49. The method of claims 4 0-47, wherein the Termamyl-like 

alpha-amylase amino acid residues being less than 3.5 A from 

t he— ne ar e s t~ne ighbour ing^amy tas e— mo±e cui-e- are—t-he^^foi-l owing— 

(using the SP722 numbering) : 

25 22, 25, 28, 76, 94, 125, 128, 158, 160, 171, 173, 174, 184, 
189, 209, 226, 229, 298, 299, 302, 306, 310, 314, 320, 345, 
405, 447, 466. 
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Title: Variants with altered properties 
ABSTRACT 

5 

The present invention relates to variants (mutants) of 
polypeptides, in particular Termamyl -like alpha-amylases , which 
variant has alpha -amylase activity and exhibits an alteration in 
at least one of the following properties relative to said parent 
10 alpha-amylase : substrate specificity, substrate binding, 
substrate cleavage pattern, thermal stability, pH/activity 
profile, pH/stability profile, stability towards oxidation, Ca2+ 
de pendenc y, s pec ific activity, and solubility, in particular 
under production conditions. 

15 
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Modtaget 



1 HHNGTNGTMM QYFEWHLPND 

2 HHNGTNGTMM QYFEWYLPND 

3 ... .VNGTLM QYFEWYTPND 

4 . .ANLNGTLM QYFEWYMPND 

5 .AAPFNGTMM QYFEWYLPDD 



GNHWNRLRDD 
GNHWNRLRDD 
GQHWKRLQND 
GQHWRRLQND 
GTLWTKVANE 



ASNLRNRGIT 
AANLKSKGIT 
AEHLSDIGIT 
SAYLiAEHGIT 
ANNLSSLGIT 



50 

AIWIPPAWKG 
AVWIPPAWKG 
AVWIPPAYKG 
AVWIPPAYKG 
ALWLPPAYKG 



10 51 

1 TSQNDVGYGA 

2 TSQNDVGYGA 

3 LSQSDNGYGP 

4 TSQADVGYGA 
15 5 TSRSDVGYGV 



YDLYDLGEFN 
YDLYDLGEFN 
YDLYDLGEFQ 
YDLYDLGEFH 
YDLYDLGEFN 



QKGTVRTKYG 
QKGTVRTKYG 
QKGTVRTKYG 
QKGTVRTKYG 
QKGTVRTKYG 



100 

TRSQLESAIH ALKNNGVQVY 
TRNQLQAAVT SLKNNGIQVY 
TKSELQDAIG SLHSRNVQVY 
TKGELQSAIK SLHSRDINVY 
TKAQYLQAIQ AAHAAGMQVY 



20 



25 



30 



101 

1 GDWMNHKGG 

2 GDWMNHKGG 

3 GDWLNHKAG 

4 GDWINHKGG 

5 ADWFDHKGG 

151 

1 TYSDFKWRWY 

2 NHSSFKWRWY 

3 TYSDFKWHWY 

4 TYSDFKWHWY 
-5 TYSS-FKWRWY- 



ADATENVLAV 
ADGTEIVNAV 
ADATEDVTAV 
ADATEDVTAV 
ADGTEWVDAV 



EVNPNNRNQE 
EVNRSNRNQE 
EVNPANRNQE 
EVDPADRNRV 
EVNPSDRNQE 



150 

ISGDYTIEAW TKFDFPGRGN 
TSGEYAIEAW TKFDFPGRGN 
TSEEYQIKAW TDFRFPGRGN 
ISGEHLIKAW THFHFPGRGS 
ISGTYQIQAW TKFDFPGRGN 



200 

HFDGVDWDQS RQFQNRIYKF RGDGKAWDWE VDSENGNYDY 
HFDGTDWDQS RQLQNKIYKF RGTGKAWDWE VDTENGNYDY 
HFDGADWDES RKI.SRIFKF RGEGKAWDWE VSSENGNYDY 
HFDGTDWDES RKL.NRIYKF , . QGKAWDWE VSNENGNYDY 
-H-FDGVDWDE S— RKIjt^S R-I-YKF 'RG I-GKAWDWE- VDTE^^ - 

Fig, 1 
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5 201 250 

1 LMYADVDMDH PEWNELRRW GEWYTNTLNL DGFRIDAVKH IKYSFTRDWL 

2 LMYADVDMDH PEVIHELRNW GVWYTNTLNL DGFRIDAVKH IKYSFTRDWL 

3 LMYADVDYDH PDWAETKKW GIWYANELSL DGFRIDAAKH IKFSFLRDWV 

4 LMYADIDYDH PDVAAEIKRW GTWYANELQL DGFRLDAVKH IKFSFLRDWV 
10 5 LMYADLDMDH PEWTELKNW GKWYVNTTNI DGFRLDAVKH IKFSFFPDWL 

251 300 

1 THVRNATGKE MFAVAEFWKN DLGALENYLN KTNWNHSVFD VPLHYNLYNA 

2 THVRNTTGKP MFAVAEFWKN DLGAIEMYLN KTSWNHSAFD VPLHYNLYNA 
15 3 QAVRQATGKE MFTVAEYWQN NAGKLENYLN KTSFNQSVFD VPLHFNLQAA 

4 NHVREKTGKE MFTVAEYWQN DLGALENYLN KTNFNHSVFD VPLHYQFHAA 

5 SYVRSQTGKP LFTVGEYWSY DINKLHNYIT KTDGTMSLFD APLHNKFYTA 

3'0'1 350 
20 1 SNSGGNYDMA KLLNGTWQK HPMHAVTFVD NHDSQPGESL ESFVQEWFKP 

2 SNSGGYYDMR NILNGSWQK HPTHAVTFVD NHDSQPGEAL ESFVQQWFKP 

3 SSQGGGYDMR RLLDGTWSR HPEKAVTFVE NHDTQPGQSL ESTVQTWFKP 

4 STQGGGYDMR KLLNGTWSK HPLKSVTFVD NHDTQPGQSL ESTVQTWFKP 

5 SKSGGAFDMR TLMTNTLMKD QPTLAVTFVD NHDTEPGQAL QSWVDPWFKP 

25 

351 400 

1 LAYALILTRE QGYPSVFYGD YYGIPTHS.. .VPAMKAKID PILEARQNFA 

2 LAYALVLTRE QGYPSVFYGD YYGIPTHG. . .VPAMKSKID PLLQARQTFA 

3 LAYAFILTRE SGYPQVFYGD MYGTKGTSPK EIPSLKDNIE PILKARKEYA 
-30 4 LAYAF-I LTRE- -SGY-PQVFY:GD-MYGT-KGDSQR--E TPALKHKI-E~P-I-LKAR-^^ - - 

5 LAYAFILTRQ EGYPCVFYGD YYGIPQYN. . . IPSLKSKID PLLIARRDYA 

401 450 

1 YGTQHDYFDH HNIIGWTREG NTTHPNSGLA TIMSDGPGGE KWMYVGQNKA 

35 2 YGTQHDYFDH HDIIGWTREG NSSHPNSGLA TIMSDGPGGN KWMYVGKNKA 

3 YGPQHDYIDH PDVIGWTREG DSSAAKSGLA ALITDGPGGS KRMYAGLKNA 

4 YGAQHDYFDH HDIVGWTREG DSSVANSGLA ALITDGPGGA KRMYVGRQNA 

5 YGTQHDYLDH SDIIGWTREG GTEKPGSGLA ALITDGPGGS KWMYVGKQHA 



40 



Fig. 1 (continued) 
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451 



500 



5 1 GQVWHDITGN KPGTVTINAD GWANFSVNGG SVSIWVKR 

2 GQVWRDITGN RTGTVTINAD GWGNFSVNGG SVSVWVKQ 

3 GETWYDITGN RSDTVKIGSD GWGEFHVNDG SVSIYVQ 

4 GETWHDITGN RSEPWINSE GWGEFHVNGG SVSIYVQR 

5 GKVFYDLTGN RSDTVTINSD GWGEFKVNGG SVSVWVPRKT TVSTIARPIT 



10 



501 



519 



15 



1 
2 
3 
4 
5 



TRPWTGEFVR WTEPRLVAW 



Fig. 1 (continued)" 
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<110> Novozymes A/S 

<120> 

<130> 

<160> 18 

<170> Patentin Ver. 3.0 



65 

SEQUENCE LISTING 



<210> 1 
<211> 1455 
15 <212> DNA 

<213> Bacillus sp . 

<220> 
<221> CDS 
20 <222> (1) . . (1455) 
<223> SP690 



<400> 1 

cat cat aat gga aca aat ggt act atg atg caa tat ttc gaa tgg tat 48 

25 His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 15 

ttg cca aat gac ggg aat cat tgg aac agg ttg agg gat gac gca get 96 
Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
30 20 25 30 

aac tta aag agt aaa ggg ata aca get gta tgg ate cca cet gca tgg 144 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 

35 40 45 

35 

aag ggg act tec ca g aat g at gta gg t tat gga g ee t at g at tta tat 1 92 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 

50 55 60 

40 gat ctt gga gag ttt aac cag aag ggg acg gtt cgt aca aaa tat gga 240 
Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

aca cgc aac cag eta cag get gcg gtg ace tct tta aaa aat aac ggc 288 
45 Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 

85 90 95 

att cag gta tat ggt gat gtc gte atg aat eat aaa ggt gga gca gat 336 
lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
50 100 105 110 

ggt acg gaa att gta aat gcg gta gaa gtg aat egg age aac cga aac 3 84 
Gly Thr Glu lie Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
115 120 125 

55 

cag gaa acc tea gga gag tat gca ata gaa gcg tgg aca aag ttt gat 4 32 
Gin Glu Thr Ser Gly Glu Tyr Ala lie Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

60 ttt cct gga aga gga aat aac cat tec age ttt aag tgg cgc tgg tat 480 
Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

cat ttt gat ggg aca gat tgg gat cag tea cgc cag ctt caa aac aaa 523 
65 His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 

165 170 175 

ata tat aaa ttc agg gga aca ggc aag gee tgg gac tgg gaa gte gat 576 
lie Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
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66 

180 185 190 

aca gag aat ggc aac tat gac tat ctt atg tat gca gac gtg gat atg 624 
Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

gat cac cca gaa gta ata cat gaa ctt aga aac tgg gga gtg tgg tat 672 
Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

acg aat aca ctg aac ctt gat gga ttt aga ata gat gca gtg aaa cat 720 
Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 



15 ata aaa tat age ttt acg aga gat tgg ctt aca cat gtg cgt aac acc 768 
lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

aca ggt aaa cca atg ttt gca gtg get gag ttt tgg aaa aat gac ctt 816 
20 Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

ggt "gca""att gaa" aac" tat t t'g "aTat" "aaa "aca" agt t^g" "aat "cac~tcg^"gtg "8 W 
Gly Ala lie Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
25 275 280 285 

ttt gat gtt cct etc cac tat aat ttg tac aat gca tct aat age ggt 912 
Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 



30 



ggt tat tat gat atg aga aat att tta aat ggt tct gtg gtg caa aaa 960 
Gly Tyr Tyr Asp Met Arg Asn lie Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 



35 cat cca aca cat gcc gtt act ttt gtt gat aac cat gat tct cag ecc 1008 
Hi s Pro Thr His A la Val Thr P he Val Asp Asn His Asp Se r Gin Pro 

3'2 5 """^^^ ~3"3no ~ 2'2 5 ^ 

ggg gaa gca ttg gaa tec ttt gtt caa caa tgg ttt aaa cca ctt gca 1056 
40 Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
340 345 350 

tat gca ttg gtt ctg aca agg gaa caa ggt tat cct tec gta ttt tat 1104 
Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
45 355 360 365 

999 9at tac tac ggt ate cca acc cat ggt gtt eeg get atg aaa tct 1152 
Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 



50 



aaa ata gac cct ctt ctg cag gca cgt caa act ttt gcc tat ggt acg 12 00 
Lys lie Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
385 390 395 400 



55 cag cat gat tac ttt gat cat cat gat att ate ggt tgg aca aga gag 1248 

Gin His Asp Tyr Phe Asp His His Asp lie lie Gly Trp Thr Arg Glu 
405 410 415 

gga aat age tec cat cca aat tea ggc ctt gcc acc att atg tea gat 12 96 

60 Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
420 425 430 

ggt cca ggt ggt aac aaa tgg atg tat gtg ggg aaa aat aaa gcg gga 1344 

Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
65 435 440 445 

caa gtt tgg aga gat att acc gga aat agg aca ggc ace gtc aca att 13 92 

Gin Val Trp Arg Asp lie Thr Gly Asn Arg Thr Gly Thr Val Thr lie 
450 455 460 
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aat gca gac gga tgg ggt aat ttc tct gtt aat gga ggg tec gtt teg 1440 
Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

gtt tgg gtg aag caa 1455 
Val Trp Val Lys Gin 
485 

<210> 2 
<211> 485 
<212> PRT 
<213> Baeillus sp. 

<400> 2 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 15 

20 Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
20 25 30 



10 



15 



25 



40 



55 



Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp He Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 



Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
30 65 70 75 80 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 

35 He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 

100 10 5 : ^ 110 



Gly Thr Glu He Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
115 120 125 

Gin Glu Thr Ser Gly Glu Tyr Ala He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 



Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
45 145 150 155 160 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
165 170 175 

50 He Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 



Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val He His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 



Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
60 225 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

65 Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
275 280 285 
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10 



25 



Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Tyr Tyr Asp Met Arg Asn lie Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
340 345 350 



Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
15 355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 

20 Lys He Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
385 390 395 400 



Liin Hrs Asp "lyr fne "Asp "Hrs Hrs "Asp' il'e^ ire~Gry Trp~Thr"A'rg 'Glu 
405 410 415 

Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 



Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
30 435 440 445 

Gin Val Trp Arg Asp lie Thr Gly Asn Arg Thr Gly Thr Val Thr He 
450 455 460 

35 Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 



Val Trp Val Lys Gin 
485 



40 



<210> 3 
<211> 1455 
45 <212> DNA 

<213> Bacillus sp . 

<220> 
<221> CDS 
50 <222> (1) . . (1455) 
<223> SP722 

<400> 3 

cat cat aat ggg aca aat ggg acg atg atg caa tac ttt gaa tgg cac 4 8 
55 His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp His 
15 10 15 

ttg cct aat gat ggg aat cac tgg aat aga tta aga gat gat get agt 96 
Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 
60 20 25 30 

aat eta aga aat aga ggt ata acc get att tgg att ccg cct gee tgg 144 
Asn Leu Arg Asn Arg Gly He Thr Ala He Trp He Pro Pro Ala Trp 
35 40 45 



65 



aaa ggg act teg caa aat gat gtg ggg tat gga gee tat gat ctt tat 192 
Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 
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gat tta ggg gaa ttt aat caa aag ggg acg gtt cgt act aag tat ggg 24 0 
Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

5 aca cgt agt caa ttg gag tct gcc ate cat get tta aag aat aat ggc 288 
Thr Arg Ser Gin Leu Glu Ser Ala lie His Ala Leu Lys Asn Asn Gly 
85 90 95 

gtt caa gtt tat ggg gat gta gtg atg aac cat aaa gga gga get gat 336 
10 Val Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

get aca gaa aac gtt ctt get gtc gag gtg aat cca aat aac egg aat 384 
Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
15 115 120 125 

caa gaa ata tct ggg gac tac aca att gag get tgg act aag ttt gat 432 
Gin Glu lie Ser Gly Asp Tyr Thr lie Glu Ala Trp Thr Lys Phe Asp 
130 135 140 



20 



40 



ttt cca ggg agg ggt aat aca tac tea gac ttt aaa tgg cgt tgg tat 480 
Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 

^45- - -1:55 - -0:6-0 



25 cat ttc gat ggt gta gat tgg gat caa tea cga caa ttc caa aat cgt 528 
His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 
165 170 175 

ate tac aaa ttc cga ggt gat ggt aag gca tgg gat tgg gaa gta gat 576 
30 lie Tyr Lys Phe Arg Gly Asp Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

teg gaa aat gga aat tat gat tat tta atg tat gca gat gta gat atg 624 
Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
35 195 200 205 

gat— eat— ccg-gag-gta-gt-a— aat- gag-ct t"aga— aga-tgg-gga"-gaa"tgg-tat 672~ 

Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr 
210 215 220 



aca aat aca tta aat ctt gat gga ttt agg ate gat gcg gtg aag eat 72 0 
Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 



45 att aaa tat age ttt aca cgt gat tgg ttg ace cat gta aga aac gca 768 
He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Ala 
245 250 255 

acg gga aaa gaa atg ttt get gtt get gaa ttt tgg aaa aat gat tta 816 
50 Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

ggt gcc ttg gag aac tat tta aat aaa aca aac tgg aat eat tct gtc 864 
Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 
55 275 280 285 

ttt gat gtc cce ctt cat tat aat ctt tat aac gcg tea aat agt gga 912 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 

290 295 300 

60 

gge aac tat gac atg gca aaa ctt ctt aat gga acg gtt gtt caa aag 960 

Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 

305 310 315 320 

65 cat cca atg cat gcc gta act ttt gtg gat aat cac gat tct caa cct 1008 
His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

ggg gaa tea tta gaa tea ttt gta caa gaa tgg ttt aag cca ctt get 1056 
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Gly Glu Ser Leu Glu Ser Phe Val Gin Glu Trp Phe Lys Pro Leu Ala 
340 345 350 

tat gcg ctt att tta aca aga gaa caa ggc tat ccc tct gtc ttc tat 1104 
5 Tyr Ala Leu He Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

ggt gac tac tat gga att cca aca cat agt gtc cca gca atg aaa gcc 1152 
Gly Asp Tyr Tyr Gly He Pro Thr His Ser Val Pro Ala Met Lys Ala 
10 370 375 380 

aag att gat cca ate tta gag gcg cgt caa aat ttt gca tat gga aca 1200 
Lys He Asp Pro He Leu Glu Ala Arg Gin Asn Phe Ala Tyr Gly Thr 
385 390 395 400 

15 

caa cat gat tat ttt gac cat cat aat ata ate gga tgg aca cgt gaa 1248 
Gin His Asp Tyr Phe Asp His His Asn He He Gly Trp Thr Arg Glu 
405 410 415 

20 gga aat acc acg cat ccc aat tea gga ctt gcg act ate atg teg gat 12 96 
Gly Asn Thr Thr His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 

ggg cca ggg gga gag aaa tgg atg tac gta ggg caa aat aaa gca ggt 1344 
25 Gly Pro Gly Gly Glu Lys Trp Met Tyr Val Gly Gin Asn Lys Ala Gly 
435 440 445 

caa gtt tgg cat gac ata act gga aat aaa cca gga aca gtt acg ate 13 92 
Gin Val Trp His Asp He Thr Gly Asn Lys Pro Gly Thr Val Thr He 
30 450 455 460 

aat gca gat gga tgg get aat ttt tea gta aat gga gga tct gtt tec 1440 
Asn Ala Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

35 

att tgg gt g aaa cga 1455 

He Trp Val Lys Arg 
485 

<210> 4 
<211> 485 
<212> PRT 
<213> Bacillus sp. 

<400> 4 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp His 
15 10 15 

50 Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 
20 25 30 



40 



45 



55 



Asn Leu Arg Asn Arg Gly He Thr Ala He Trp He Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 



Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
60 65 70 75 80 

Thr Arg Ser Gin Leu Glu Ser Ala He His Ala Leu Lys Asn Asn Gly 
85 90 95 

65 Val Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 
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Gin Glu lie Ser Glv Asp Tyr Thr lie Glu Ala Trp Thr Lys Phe Asp 
130 ' 135 140 

Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 
165 170 175 

lie Tyr Lys Phe Arg Gly Asp Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 



Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
15 195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr 
210 215 220 

20 Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 



25 



Tie Lys Tyr Ser 'Phe "Thr "Arg Asp Trp Leu "Thr His Val Arg Asn Ala 
245 250 255 

Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 



Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 

30 275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

35 Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 

305 310 315 320 



40 



His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ser Leu Glu Ser Phe Val Gin Glu Trp Phe Lys Pro Leu Ala 
340 345 350 



Tyr Ala Leu lie Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
45 355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Ser Val Pro Ala Met Lys Ala 
370 375 380 

50 Lys He Asp Pro He Leu Glu Ala Arg Gin Asn Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asn He He Gly Trp Thr Arg Glu 
405 410 415 



55 



Gly Asn Thr Thr His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 



Gly Pro Gly Gly Glu Lys Trp Met Tyr Val Gly Gin Asn Lys Ala Gly 
60 435 440 445 

Gin Val Trp His Asp He Thr Gly Asn Lys Pro Gly Thr Val Thr He 
450 455 460 

65 Asn Ala Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

He Trp Val Lys Arg 
485 
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<210> 5 

5 <211> 1548 

<212> DNA 

<213> Bacillus stearothermophilus 

<220> 

10 <221> CDS 

<222> (1) . . (1548) 

<223> BSG 

<400> 5 

15 gcc gca ccg ttt aac ggc acc atg atg cag tat ttt gaa tgg tac ttg 

Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu 

1 5 10 15 

ccg gat gat ggc acg tta tgg acc aaa gtg gcc aat gaa gcc aac aac 
20 Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn 
' 20 25 30 

tta -tec -agc-ct t ggc^ ate -acc - get ct t tgg-ctg -eeg-ece- get - tae -aaa 
Leu Ser Ser Leu Gly lie Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys 
25 35 40" 45 

gga aca age cgc age gac gta ggg tac gga gta tac gac ttg tat gae 
Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp 
50 55 60 

30 

etc ggc gaa tte aat caa aaa ggg ace gte ege aca aaa tae gga aca 
Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr 
65 70 75 80 

35 aaa get caa tat ctt caa gcc att caa gee gcc cac gcc get gga atg 
Lys Ala Gin Tyr Leu Gin Ala He Gin Ala Ala His Ala Ala Gly Met 

~ — ~85 —90— -95— 

caa gtg tae gee gat gte gtg tte gae eat aaa ggc ggc get gae ggc 
40 Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly 
100 105 110 

acg gaa tgg gtg gac gee gte gaa gte aat ccg tec gae cgc aac caa 
Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gin 
45 115 120 125 

gaa ate teg ggc acc tat caa ate caa gca tgg acg aaa ttt gat ttt 
Glu He Ser Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe Asp Phe 



50 



130 135 140 

ccc ggg egg ggc aac ace tac tec age ttt aag tgg cgc tgg tac cat 
Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His 
145 150 155 160 

55 ttt gac ggc gtt gat tgg gae gaa age cga aaa ttg age ege att tae 
Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg He Tyr 
165 170 175 

aaa tte cgc ggc ate ggc aaa geg tgg gat tgg gaa gta gac acg gaa 
60 Lys Phe Arg Gly He Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu 
180 185 190 

aac gga aac tat gac tac tta atg tat gcc gac ctt gat atg gat cat 
Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His 
65 195 200 205 

ccc gaa gte gtg acc gag etg aaa aac tgg ggg aaa tgg tat gte aac 
Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr Val Asn 
210 215 220 
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aca acg aac att gat ggg ttc egg ctt gat gcc gtc aag cat att aag 720 
Thr Thr Asn lie Asp Gly Phe Arg Leu Asp Ala Val Lys His lie Lys 
225 230 235 240 

5 

ttc agt ttt ttt cct gat tgg ttg teg tat gtg cgt tct cag act ggc 768 
Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin Thr Gly 
245 250 255 

10 aag ccg eta ttt aec gtc ggg gaa tat tgg age tat gac ate aac aag 816 
Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp lie Asn Lys 
260 265 270 

ttg cac aat tac att acg aaa aca gae gga aeg atg tct ttg ttt gat 864 
15 Leu His Asn Tyr lie Thr Lys Thr Asp Gly Thr Met Ser Leu Phe Asp 
275 280 285 

gcc ccg tta cac aac aaa ttt tat ace get tec aaa tea ggg ggc gca 912 
Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala 
20 290 295 300 

ttt^gat atg cgc acg tta atg ace aat act etc atg_ aaa gat ea^ ccg 960 

"Ph"e~Asp""Mer~Arg "THr^'L'eu^Metr'Th^ TJir"L'eu "Met" LYs~A:sp""Gl"n "PYo" 

305 310 315 320 

25 

aca ttg gcc gtc ace ttc gtt gat aat cat gac ace gaa ccc ggc caa 1008 

Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gin 

325 330 335 

30 gcg ctg cag tea tgg gtc gac cea tgg ttc aaa ccg ttg get tac gee 1056 
Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 
340 345 350 

ttt att eta act egg cag gaa gga tac ccg tge gtc ttt tat ggt gac 1104 
35 Phe lie Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp 

3S5 ^ 360 365 

tat tat ggc att eca caa tat aac att cct teg ctg aaa age aaa ate 1152 
Tyr Tyr Gly lie Pro Gin Tyr Asn lie Pro Ser Leu Lys Ser Lys lie 
40 370 375 380 

gat ccg etc etc ate gcg cgc agg gat tat get tac gga acg caa eat 12 00 
Asp Pro Leu Leu lie Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His 
385 390 395 400 



45 



65 



gat tat ctt gat cac tec gac ate ate ggg tgg aca agg gaa ggg ggc 124 8 
Asp Tyr Leu Asp His Ser Asp He He Gly Trp Thr Arg Glu Gly Gly 
405 410 415 



50 act gaa aaa cea gga tec gga ctg gee gca ctg ate ace gat ggg ccg 12 96 
Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro 
420 425 430 

gga gga age aaa tgg atg tac gtt ggc aaa caa cac get gga aaa gtg 1344 
55 Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys Val 
435 440 445 

ttc tat gae ctt ace ggc aac egg agt gac ace gtc ace ate aac agt 13 92 
Phe T^^r Asp Leu Thr Gly A.sn Arg Ser Asp Thr Val Thr He Asn Ser 
60 450 455 460 

gat gga tgg ggg gaa ttc aaa gtc aat ggc ggt teg gtt teg gtt tgg 1440 
Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp 
465 470 475 480 



gtt cct aga aaa acg ace gtt tct ace ate get egg ccg ate aca ace 1488 
Val Pro Arg Lys Thr Thr Val Ser Thr He Ala Arg Pro He Thr Thr 
485 490 495 
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cga ccg tgg act ggt gaa ttc gtc cgt tgg acc gaa cca egg ttg gtg 1536 
Arg Pro Trp Thr Gly Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val 
500 505 510 

5 gca tgg cct tga 1548 
Ala Trp Pro 
515 

10 <210> 6 

<211> 515 
<212> PRT 

<213> Bacillus stearothermophi lus 
15 <400> 6 

Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu 
15 10 15 

Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn 
20 20 25 30 

Leu Ser Ser Leu Gly lie Thr Ala Leu Trp Leu Pro P ro Ala Tyr Ly^is 

^3'5- - - ' " - 40 " " 45 

25 Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp 
50 55 60 



30 



45 



60 



Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr 
65 70 75 80 

Lys Ala Gin Tyr Leu Gin Ala He Gin Ala Ala His Ala Ala Gly Met 
85 90 95 



Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly 
35 100 105 110 

ThFl31ir"Trp~Val Asp Ala Val gTiT Val ~Asn' Pro^eiTXspTArg^Asn^Tir 
115 120 125 

40 Glu He Ser Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe Asp Phe 
130 135 140 



Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His 
145 150 155 160 

Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg He Tyr 
165 170 175 



Lys Phe Arg Gly He Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu 
50 180 185 190 

Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His 
195 200 205 

55 Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr Val Asn 
210 215 220 



Thr Thr Asn He Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys 
225 230 235 240 

Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin Thr Gly 
245 250 255 



Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp He Asn Lys 

65 260 265 270 

Leu His Asn Tyr He Thr Lys Thr Asp Gly Thr Met Ser Leu Phe Asp 
275 280 285 
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Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala 
290 295 300 

Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gin Pro 
5 305 310 315 320 

Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gin 
325 330 335 

10 Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 
340 345 350 



15 



30 



45 



60 



Phe lie Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp 
355 360 365 

Tyr Tyr Gly lie Pro Gin Tyr Asn lie Pro Ser Leu Lys Ser Lys lie 
370 375 380 



Asp Pro Leu Leu lie Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His 

20 385 390 395 400 

Asp Tyr Leu Asp His Ser Asp lie lie Gly Trp Thr Arg Glu Gly Gly 

" " 4ro5' - - - " ^a:o~ ' ~ ' ~^'i5 ~ 

25 Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro 

420 425 430 

Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys Val 

435 440 445 



Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr lie Asn Ser 
450 455 460 



Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp 
35 465 470 475 480 



Val Pro Arg Lys Thr Thr Val Ser Thr lie Ala Arg Pro lie Thr Thr 
485 490 495 

40 Arg Pro Trp Thr Gly Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val 
500 505 510 



Ala Trp Pro 
515 



<210> 7 

<211> 1920 

50 <212> DNA 

<213> Bacillus lichenif ormis 

<220> 

<221> CDS 

55 <222> (421) . . (1872) 

<223> Termamyl 



<400> 7 

cggaagattg gaagtacaaa aataagcaaa agattgtcaa tcatgtcatg agccatgcgg 60 



gagacggaaa aatcgtctta atgcacgata tttatgcaac gttcgcagat gctgctgaag 120 

agattattaa aaagctgaaa gcaaaaggct atcaattggt aactgtatct cagcttgaag 180 

65 aagtgaagaa gcagagaggc tattgaataa atgagtagaa gcgccatatc ggcgcttttc 240 

ttttggaaga aaatataggg aaaatggtac ttgttaaaaa ttcggaatat ttatacaaca 300 

tcatatgttt cacattgaaa ggggaggaga atcatgaaac aacaaaaacg gctttacgcc 360 
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cgattgctga cgctgttatt tgcgctcatc ttcttgctgc ctcattctgc agcagcggcg 420 

gca aat ctt aat ggg acg ctg atg cag tat ttt gaa tgg tac atg ccc 468 
5 Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro 
15 10 15 

aat gac ggc caa cat tgg agg cgt ttg caa aac gac teg gca tat ttg 516 
Asn Asp Gly Gin His Trp Arg Arg Leu Gin Asn Asp Ser Ala Tyr Leu 
10 20 25 30 

get gaa cac ggt att act gcc gtc tgg att ccc ccg gca tat aag gga 564 

Ala Glu His Gly lie Thr Ala Val Trp He Pro Pro Ala Tyr Lys Gly 

35 40 45 

15 

acg age caa gcg gat gtg ggc tac ggt get tac gac ctt tat gat tta 612 

Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu 

50 55 60 

20 ggg gag ttt cat caa aaa ggg acg gtt egg aca aag tac ggc aca aaa 660 
Gly Glu Phe His Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 
65 70 75 80 

gga gag ctg caa tet gcg ate aaa agt ctt eat tec egc gac att aac 7 08 
25 Gly Glu Leu Gin Ser Ala He Lys Ser Leu His Ser Arg Asp He Asn 

85 90 95 

gtt tac ggg gat gtg gtc ate aac cac aaa ggc ggc get gat gcg acc 756 
Val Tyr Gly Asp Val Val He Asn His Lys Gly Gly Ala Asp Ala Thr 
30 100 105 110 

gaa gat gta ace gcg gtt gaa gtc gat ccc get gac cgc aac egc gta 8 04 

Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val 

115 120 125 

35 

att t ea g ga gaa cac eta att aaa gee tgg aca cat ttt eat ttt ccg 8_52_^ 

He Ser Gly Glu "His Leu He Lys Ala Trp Thr His Phe His Phe Pro 
130 135 140 

40 ggg egc ggc age aca tac age gat ttt aaa tgg eat tgg tac cat ttt 900 
Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 
145 150 155 160 

gac gga acc gat tgg gac gag tec ega aag ctg aac cgc ate tat aag 948 
45 Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg He Tyr Lys 

165 170 175 

ttt caa gga aag get tgg gat tgg gaa gtt tee aat gaa aac ggc aac 996 
Phe Gin Gly Lys Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn 
50 180 185 190 

tat gat tat ttg atg tat gee gac ate gat tat gac cat cet gat gtc 1044 
Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Tyr Asp His Pro Asp Val 
195 200 205 

55 

gca gca gaa att aag aga tgg ggc act tgg tat gee aat gaa ctg caa 1092 
Ala Ala Glu He Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin 
210 215 220 

60 ttg gac ggt ttc cgt ctt gat get gtc aaa cac att aaa ttt tct ttt 1140 
Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Phe Ser Phe 
225 230 235 240 

ttg egg gat tgg gtt aat cat gtc agg gaa aaa acg ggg aag gaa atg 1188 
65 Leu Arg Asp Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met 

245 250 255 

ttt acg gta get gaa tat tgg cag aat gac ttg ggc gcg ctg gaa aac 1236 
Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp Leu Gly Ala Leu Glu Asn 
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260 265 270 

tat ttg aac aaa aca aat ttt aat cat tea gtg ttt gac gtg ccg ctt 1284 
Tyr Leu Asn Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu 
275 280 285 

cat tat cag ttc cat get gca teg aca cag gga ggc ggc tat gat atg 13 3 2 
His Tyr Gin Phe His Ala Ala Ser Thr Gin Gly Gly Gly Tyr Asp Met 
290 295 300 

agg aaa ttg ctg aac ggt acg gtc gtt tec aag cat ccg ttg aaa teg 1380 
Arg Lys Leu Leu Asn Gly Thr Val Val Ser Lys His Pro Leu Lys Ser 
305 310 315 320 

15 gtt aea ttt gtc gat aac cat gat aca cag ccg ggg caa teg ctt gag 1428 
Val Thr Phe Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 
325 330 335 

teg act gtc caa aca tgg ttt aag ccg ctt get tac get ttt att etc 1476 
20 Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu 
340 345 350 

aca— agg gaa" tct "gga tac cct" cag~gt: t" "t: tc^ "tac ggg gat' atg "tac ggg r52"4' 
Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly 
25 355 360 365 

acg aaa gga gac tec cag cgc gaa att ect gee ttg aaa cac aaa att 1572 
Thr Lys Gly Asp Ser Gin Arg Glu lie Pro Ala Leu Lys His Lys lie 
370 375 380 



30 



60 



65 



gaa ccg ate tta aaa gcg aga aaa cag tat gcg tac gga gca cag eat 1620 
Glu Pro lie Leu Lys Ala Arg Lys Gin Tyr Ala Tyr Gly Ala Gin His 
385 390 395 400 



35 gat tat ttc gac cac cat gac att gtc ggc tgg aea agg gaa ggc gac 1668 

Asp T yr Phe Asp His His Asp lie Va l Gly Trp T hr Ar g Glu Gly Asp 

4'0'5 410 415 

age teg gtt gca aat tea ggt ttg gcg gca tta ata aca gac gga cec 1716 
40 Ser Ser Val Ala Asn Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro 
420 425 430 

99^ 999 gca aag ega atg tat gtc ggc egg caa aac gee ggt gag aca 1764 
Gly Gly Ala Lys Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr 
45 435 440 445 

tgg eat gac att acc gga aac cgt teg gag ccg gtt gtc ate aat teg 1812 
Trp His Asp lie Thr Gly Asn Arg Ser Glu Pro Val Val lie Asn Ser 
450 455 460 

50 

gaa ggc tgg gga gag ttt cac gta aac ggc ggg teg gtt tea att tat 1860 
Glu Gly Trp Gly Glu Phe His Val Asn Gly Gly Ser Val Ser lie Tyr 
465 470 475 480 

55 gtt caa aga tag aagagcagag aggaeggatt tcctgaagga aatccgtttt 1912 
Val Gin Arg 



tttatttt 1920 

<210> 8 
<211> 483 
<212> PRT 

<213> Bacillus 1 ichenif ormis 
<400> 8 

Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro 
15 10 15 
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Asn Asp Gly Gin His Trp Arg Arg Leu Gin Asn Asp Ser Ala Tyr Leu 
20 25 30 

Ala Glu His Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly 
5 35 40 45 

Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu 
50 55 60 

10 Gly Glu Phe His Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 
65 70 75 80 



15 



30 



5 



) 



Gly Glu Leu Gin Ser Ala lie Lys Ser Leu His Ser Arg Asp lie Asn 
85 90 95 

Val Tyr Gly Asp Val Val lie Asn His Lys Gly Gly Ala Asp Ala Thr 
100 105 110 



Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val 
20 115 120 125 

He Ser Gly Glu His Leu He Lys Ala Trp Thr His Phe His Phe Pro 
13 0 135 14 0 

25 Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 
145 150 155 160 



Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg He Tyr Lys 
165 170 175 

Phe Gin Gly Lys Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn 
180 185 190 



Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Tyr Asp His Pro Asp Val 
35 195 200 205 

'Ala""Ala~Gru~ir^ Lys ArgfTrp" Gly "TlTr'" Trp' fy^ AsnTGlu LeunsirT 

210 215 220 

0 Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Phe Ser Phe 
225 230 235 240 



Leu Arg . Asp Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met 
245 250 255 

Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp Leu Gly Ala Leu Glu Asn 
260 265 270 



Tyr Leu Asn Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu 
D 275 _ 280 285 

His Tyr Gin Phe His Ala Ala Ser Thr Gin Gly Gly Gly Tyr Asp Met 
290 295 300 

5 Arg Lys Leu Leu Asn Gly Thr Val Val Ser Lys His Pro Leu Lys Ser 
305 310 315 320 



Val Thr Phe Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 

325 330 335 

Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu 

340 345 350 



Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly 

> 355 360 ^ 365 

Thr Lys Gly Asp Ser Gin Arg Glu He Pro Ala Leu Lys His Lys He 
370 375 380 
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Glu Pro lie Leu Lys Ala Arg Lys Gin Tyr Ala Tyr Gly Ala Gin His 
385 390 395 400 

Asp Tyr Phe Asp His His Asp lie Val Gly Trp Thr Arg Glu Gly Asp 
5 405 410 415 

Ser Ser Val Ala Asn Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro 
420 425 430 

10 Gly Gly Ala Lys Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr 
435 440 445 



15 



20 



Trp His Asp lie Thr Gly Asn Arg Ser Glu Pro Val Val lie Asn Ser 
450 455 460 

Glu Gly Trp Gly Glu Phe His Val Asn Gly Gly Ser Val Ser lie Tyr 
465 470 475 480 

Val Gin Arg 



<211> 2084 

25 <212> DNA 

<213> Bacillus amyloliquef aciens 

<220> 

<221> CDS 

30 <222> (343) . . (1794) 

<223> BAN 



35 



<400> 9 

gccccgcaca tacgaaaaga ctggctgaaa acattgagcc tttgatgact gatgatttgg 60 
ctgaagaagt ggatcgattg tttgagaaaa gaagaagacc ataaaaatac cttgtctgtc 12 0 



atcagacagg gtatttttta tgctgtccag actgtccgct gtgtaaaaat aaggaataaa 180 

40 ggggggttgt tattatttta ctgatatgta aaatataatt tgtataagaa aatgagaggg 240 

B-^B-ggsLSLSiCSi tgattcaaaa acgaaagcgg acagtttcgt tcagacttgt gcttatgtgc 3 00 

acgctgttat ttgtcagttt gccgattaca aaaacatcag cc gta aat ggc acg 354 
45 Val Asn Gly Thr 

1 

ctg atg cag tat ttt gaa tgg tat acg ccg aac gac ggc cag cat tgg 4 02 
Leu Met Gin Tyr Phe Glu Trp Tyr Thr Pro Asn Asp Gly Gin His Trp 
50 5 10 15 20 

aaa cga ttg cag aat gat gcg gaa cat tta teg gat ate gga ate act 450 

Lys Arg Leu Gin Asn Asp Ala Glu His Leu Ser Asp lie Gly lie Thr 
25 30 35 

55 

gee gtc tgg att cct ccc gca tac aaa gga ttg age caa tec gat aac 4 98 

Ala Val Trp lie Pro Pro Ala Tyr Lys Gly Leu Ser Gin Ser Asp Asn 
40 45 50 

60 gga tac gga cct tat gat ttg tat gat tta gga gaa ttc cag caa aaa 546 
Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu Gly Glu Phe Gin Gin Lys 
55 60 65 

ggg acg gtc aga acg aaa tac ggc aca aaa tea gag ctt caa gat gcg 594 
65 Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ser Glu Leu Gin Asp Ala 
70 75 80 

ate ggc tea ctg eat tee egg aac gtc caa gta tac gga gat gtg gtt 64 2 
lie Gly Ser Leu His Ser Arg Asn Val Gin Val Tyr Gly Asp Val Val 
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80 

85 90 95 100 

ttg aat cat aag get ggt get gat gca aca gaa gat gta act gcc gtc 690 
Leu Asn His Lys Ala Gly Ala Asp Ala Thr Glu Asp Val Thr Ala Val 
105 110 115 

gaa gtc aat ccg gcc aat aga aat cag gaa act teg gag gaa tat caa 738 
Glu Val Asn Pro Ala Asn Arg Asn Gin Glu Thr Ser Glu Glu Tyr Gin 
120 125 130 

ate aaa gcg tgg aeg gat ttt cgt ttt ccg ggc cgt gga aac acg tae 786 
lie Lys Ala Trp Thr Asp Phe Arg Phe Pro Gly Arg Gly Asn Thr Tyr 
135 140 145 



15 agt gat ttt aaa tgg cat tgg tat cat ttc gae gga gcg gac tgg gat 834 
Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly Ala Asp Trp Asp 
150 155 160 

gaa tec egg aag ate age egc ate ttt aag ttt cgt ggg gaa gga aaa 882 
20 Glu Ser Arg Lys lie Ser Arg lie Phe Lys Phe Arg Gly Glu Gly Lys 
165 170 175 180 

gcg tgg gat tgg gaa gta tea agt gaa aac ggc aac ta^t gac t~at tta 930 
Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn Tyr Asp Tyr Leu 
25 185 190 195 

atg tat get gat gtt gac tac gac eac ect gat gtc gtg gca gag aca 978 
Met Tyr Ala Asp Val Asp Tyr Asp His Pro Asp Val Val Ala Glu Thr 
200 205 210 



aaa aaa tgg ggt ate tgg tat gcg aat gaa ctg tea tta gae ggc ttc 1026 
Lys Lys Trp Gly lie Trp Tyr Ala Asn Glu Leu Ser Leu Asp Gly Phe 
215 220 225 



35 cgt att gat gee gee aaa eat att aaa ttt tea ttt ctg cgt gat tgg 1074 

Arg lie Asp Ala Ala Lys His lie Lys Phe Ser Phe Leu Arg Asp Trp 

~220 2 3 ST 240 ~ ~" 

gtt cag gcg gtc aga cag gcg acg gga aaa gaa atg ttt acg gtt gcg 1122 
40 Val Gin Ala Val Arg Gin Ala Thr Gly Lys Glu Met Phe Thr Val Ala 
245 250 255 260 

gag tat tgg cag aat aat gcc ggg aaa etc gaa aac tac ttg aat aaa 117 0 
Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu Glu Asn Tyr Leu Asn Lys 
45 265 270 275 

aca age ttt aat caa tee gtg ttt gat gtt ccg ctt cat ttc aat tta 1218 
Thr Ser Phe Asn Gin Ser Val Phe Asp Val Pro Leu His Phe Asn Leu 
280 285 290 

50 

cag gcg get tec tea caa gga ggc gga tat gat atg agg cgt ttg ctg 1266 
Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr Asp Met Arg Arg Leu Leu 
295 300 305 

55 gac ggt ace gtt gtg tec agg cat ccg gaa aag gcg gtt aca ttt gtt 1314 
Asp Gly Thr Val Val Ser Arg His Pro Glu Lys Ala Val Thr Phe Val 
310 315 320 

gaa aat cat gac aca cag ccg gga cag tea ttg gaa teg aca gtc caa 1362 
60 Glu Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu Ser Thr Val Gin 
325 330 335 340 

act tgg ttt aaa ccg ctt gca tac gcc ttt att ttg aca aga gaa tec 1410 
Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu Thr Arg Glu Ser 
65 345 350 355 

ggt tat ect cag gtg ttc tat ggg gat atg tac ggg aca aaa ggg aca 1458 
Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly Thr Lys Gly Thr 
360 365 370 
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teg cca aag gaa att ccc tea ctg aaa gat aat ata gag ceg att tta 1506 

Ser Pro Lys Glu lie Pro Ser Leu Lys Asp Asn lie Glu Pro lie Leu 

375 380 385 

5 

aaa gcg cgt aag gag tac gea tac ggg eee cag cac gat tat att gac 1554 

Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro Gin His Asp Tyr lie Asp 

390 395 400 

10 cac ceg gat gtg ate gga tgg acg agg gaa ggt gac age tec gee gee 1602 
His Pro Asp Val lie Gly Trp Thr Arg Glu Gly Asp Ser Ser Ala Ala 
405 410 415 420 

aaa tea ggt ttg gee get tta ate acg gac gga ccc gge gga tea aag 1650 
15 Lys Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro Gly Gly Ser Lys 

425 430 435 

egg atg tat gee gge ctg aaa aat gee ggc gag aca tgg tat gac ata 1698 
Arg Met Tyr Ala Gly Leu Lys Asn Ala Gly Glu Thr Trp Tyr Asp lie 
20 440 445 450 

acg gge aac cgt tea gat act gta aaa ate gga tct gac gge tgg gga 174 6 

Thr Gly Asn Arg"Ser Asp Thr 'Val Lys lie "Gly Ser Asp" Gly Trp ^ly 
455 460 465 

25 

gag ttt eat gta aac gat ggg tec gtc tec att tat gtt cag aaa taa 1794 

Glu Phe His Val Asn Asp Gly Ser Val Ser lie Tyr Val Gin Lys 
470 475 480 

30 ggtaataaaa aaacacctcc aagctgagtg egggtatcag ettggaggtg cgtttatttt 1854 

ttcagccgta tgacaaggtc ggcatcaggt gtgacaaata cggtatgctg gctgtcatag 1914 

gtgacaaatc cgggttttgc gcegtttggc tttttcacat gtctgatttt tgtataatca 1974 

35 

acaggcacgg agcegga ate tt tcgcc ttg gaaaa ataag c ggcgatcgt agctgc ttcc 203 4 
aatatggatt gtteatcggg atcgctgctt ttaateacaa cgtgggatce 2084 

40 

<210> 10 
<211> 483 
<212> PRT 

<213> Bacillus amylol iquef aciens 

45 

<400> 10 

Val Asn Gly Thr Leu Met 
1 5 

50 Gly Gin His Trp Lys Arg 
20 

He Gly He Thr Ala Val 
35 

55 

Gin Ser Asp Asn Gly Tyr 
50 

Phe Gin Gin Lys Gly Thr 
60 65 70 

Leu Gin Asp Ala He Gly 
85 

65 Gly Asp Val Val Leu Asn 
100 

Val Thr Ala Val Glu Val 
115 



Gin Tyr Phe Glu Trp Tyr Thr Pro Asn Asp 
10 15 

Leu Gin Asn Asp Ala Glu His Leu Ser Asp 
25 30 

Trp lie Pro Pro Ala Tyr Lys Gly Leu Ser 
40 45 

Gly Pro Tyr Asp Leu Tyr Asp Leu Gly Glu 
55 60 

Val Arg Thr Lys Tyr Gly Thr Lys Ser Glu 
75 80 

Ser Leu His Ser Arg Asn Val Gin Val Tyr 
90 95 

His Lys Ala Gly Ala Asp Ala Thr Glu Asp 
105 110 

Asn Pro Ala Asn Arg Asn Gin Glu Thr Ser 
120 125 
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40 



55 



82 



Glu Glu Tyr Gin lie Lys Ala Trp Thr Asp Phe Arg Phe Pro Gly Arg 
130 135 140 

Gly Asn Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly 
145 150 155 160 

Ala Asp Trp Asp Glu Ser Arg Lys lie Ser Arg lie Phe Lys Phe Arg 
165 170 175 

Gly Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn 
180 185 190 



Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro Asp Val 
15 195 200 205 

Val Ala Glu Thr Lys Lys Trp Gly lie Trp Tyr Ala Asn Glu Leu Ser 
210 215 220 

20 Leu Asp Gly Phe Arg lie Asp Ala Ala Lys His lie Lys Phe Ser Phe 
225 230 235 240 



Leu Arg Asp Trp Val Gin Ala Val Arg Gin Ala Thr Gly Lys Glu Met 
245 250 255 

Phe Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu Glu Asn 
260 265 270 



Tyr Leu Asn Lys Thr Ser Phe Asn Gin Ser Val Phe Asp Val Pro Leu 
30 275 280 285 

His Phe Asn Leu Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr Asp Met 
290 295 300 

35 Arg Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu Lys Ala 

305 310 315 320 



Val Thr Phe Val Glu Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 
325 330 335 

Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu 

340 345 350 



Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly 

45 355 360 365 

Thr Lys Gly Thr Ser Pro Lys Glu lie Pro Ser Leu Lys Asp Asn lie 

370 375 380 

50 Glu Pro lie Leu Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro Gin His 

385 390 395 400 



Asp Tyr lie Asp His Pro Asp Val lie Gly Trp Thr Arg Glu Gly Asp 
405 410 415 

Ser Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro 
420 425 430 



Gly Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys Asn Ala Gly Glu Thr 
60 435 440 445 

Trp Tyr Asp He Thr Gly Asn Arg Ser Asp Thr Val Lys He Gly Ser 
450 455 460 

65 Asp Gly Trp Gly Glu Phe His Val Asn Asp Gly Ser Val Ser He Tyr 
465 470 475 480 

Val Gin Lys 
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<210> 11 
<211> 1458 
5 <212> DNA 

<213> Bacillus sp . 

<220> 
<221> CDS 
10 <222> (1) . . (1458) 
<223> AA560 

<400> 11 

cac cat aat ggt acg aac ggc aca atg atg cag tac ttt gaa tgg tat 48 
15 His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
1 5 10 15 

eta cca aat gac gga aac cat tgg aat aga tta agg tct gat gca agt 96 
Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Ser Asp Ala Ser 
20 20 25 30 

aac eta aaa gat aaa ggg ate tea gcg gtt tgg att ect cct gca tgg 144 
Leu "Lys- Asp Lys Gly Tie Ser ATa VaT Trp lie Pro Pro "Ala Trp 
35 40 45 



25 



65 



'As 11 



aag ggt gee tct caa aat gat gtg ggg tat ggt get tat gat ctg tat 192 
Lys Gly Ala Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 



30 gat tta gga gaa ttc aat caa aaa gga acc att cgt aca aaa tat gga 240 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr lie Arg Thr Lys Tyr Gly 

65 70 75 80 

acg cgc aat cag tta caa get gca gtt aac gee ttg aaa agt aat gga 288 

35 Thr Arg Asn Gin Leu Gin Ala Ala Val Asn Ala Leu Lys Ser Asn Gly 

85 90 95 



att caa gtg tat ggc gat gtt gta atg aat cat aaa ggg gga gca gac 336 
lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
40 100 105 110 

get acc gaa atg gtt agg gca gtt gaa gta aac eeg aat aat aga aat 384 

Ala Thr Glu Met Val Arg Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

45 

caa gaa gtg tec ggt gaa tat aca att gag get tgg aca aag ttt gac 432 

Gin Glu Val Ser Gly Glu Tyr Thr lie Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

50 ttt cca gga cga ggt aat act eat tea aac ttc aaa tgg aga tgg tat 48 0 
Phe Pro Gly Arg Gly Asn Thr His Ser Asn Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

cac ttt gat gga gta gat tgg gat cag tea cgt aag ctg aac aat cga 52 8 
55 His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Lys Leu Asn Asn Arg 

165 170 175 

att tat aaa ttt aga ggt gat gga aaa ggg tgg gat tgg gaa gtc gat 576 
lie Tyr Lys Phe Arg Gly Asp Gly Lys Gly Trp Asp Trp Glu Val Asp 
60 180 " ' 185 190 

aca gaa aac ggt aac tat gat tac eta atg tat gca gat att gac atg 624 
Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp lie Asp Met 
195 200 205 



gat cac cca gag gta gtg aat gag eta aga aat tgg ggt gtt tgg tat 672 
Asp His Pro Glu Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 
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acg aat aca tta ggc ctt gat ggt ttt aga ata gat gca gta aaa cat 72 0 
Thr Asn Thr Leu Gly Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 

5 ata aaa tac age ttt act cgt gat tgg att aat cat gtt aga agt gca 768 
lie Lys Tyr Ser Phe Thr Arg Asp Trp He Asn His Val Arg Ser Ala 
245 250 255 

act ggc aaa aat atg ttt gcg gtt gcg gaa ttt tgg aaa aat gat tta 816 
10 Thr Gly Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

ggt get att gaa aac tat tta aac aaa aca aac tgg aac cat tea gtc 864 
Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 
15 275 280 285 



20 



40 



ttt gat gtt ccg etg cae tat aac etc tat aat get tea aaa age gga 912 
Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly 
290 295 300 

ggg aat tat gat atg agg caa ata ttt aat ggt aca gtc gtg caa aga 960 
Gly Asn Tyr Asp Met Arg Gin He Phe Asn Gly Thr Val Val Gin Arg 
305 310 315 320 



25 cat eca atg cat get gtt aca ttt gtt gat aat cat gat teg caa cet 1008 

His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

gaa gaa get tta gag tct ttt gtt gaa gaa tgg ttc aaa eca tta gcg 1056 

30 Glu Glu Ala Leu Glu Ser Phe Val Glu Glu Trp Phe Lys Pro Leu Ala 
340 345 350 

tat get ttg aca tta aca cgt gaa caa ggc tac cct tct gta ttt tat 1104 

Tyr Ala Leu Thr Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 

35 355 360 365 



gga gat tat tat ggc att eca acg cat ggt gta eca gcg atg aaa teg 1152 
Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 

aaa att gae ccg att eta gaa gcg cgt caa aag tat gea tat gga aga 1200 
Lys He Asp Pro He Leu Glu Ala Arg Gin Lys Tyr Ala Tyr Gly Arg 
385 390 395 400 

45 caa aat gae tac tta gac cat cat aat ate ate ggt tgg aca cgt gaa 1248 
Gin Asn Asp Tyr Leu Asp His His Asn He He Gly Trp Thr Arg Glu 
405 410 415 

ggg aat aca gca eac ecc aac tec ggt tta get act ate atg tec gat 1296 
50 Gly Asn Thr Ala His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 

999 gca gga gga aat aag tgg atg ttt gtt ggg cgt aat aaa get ggt 1344 
Gly Ala Gly Gly Asn Lys Trp Met Phe Val Gly Arg Asn Lys Ala Gly 
55 435 440 445 

caa gtt tgg ace gat ate act gga aat cgt gea ggt act gtt acg att 1392 
Gin Val Trp Thr Asp He Thr Gly Asn Arg Ala Gly Thr Val Thr He 
450 455 460 



60 



aat get gat gga tgg ggt aat ttt tct gta aat gga gga tea gtt tct 1440 
Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 



65 att tgg gta aac aaa taa 1458 
He Trp Val Asn Lys 
485 
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<210> 12 

<211> 485 

<212> PRT 

<213> Bacillus sp . 

5 

<400> 12 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 15 

10 Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Ser Asp Ala Ser 
20 25 30 



15 



30 



45 



60 



Asn Leu Lys Asp Lys Gly He Ser Ala Val Trp He Pro Pro Ala Trp 
35 40 45 

Lys Gly Ala Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 



Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr He Arg Thr Lys Tyr Gly 
20 65 70 75 80 

Thr Arg Asn Gin Leu Gin Ala Ala Val Asn Ala Leu Lys Ser Asn Gly 

85 "30 " 95" 

25 He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 



Ala Thr Glu Met Val Arg Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu Val Ser Gly Glu Tyr Thr He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 



Phe Pro Gly Arg Gly Asn Thr His Ser Asn Phe Lys Trp Arg Trp Tyr 
35 145 150 155 160 



^ H±s -Phe"A-sp"Giy-Va"l -A'sp^Trp~A'sp' "Gln~Ser^Arg~Lys~Lreu^Asn Asn Arg 

165 170 175 

40 He Tyr Lys Phe Arg Gly Asp Gly Lys Gly Trp Asp Trp Glu Val Asp 
180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Met 
195 200 205 



Asp His Pro Glu Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 



Thr Asn Thr Leu Gly Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
50 225 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp He Asn His Val Arg Ser Ala 

245 250 255 

55 Thr Gly Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 



Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly 
290 295 300 



Gly Asn Tyr Asp Met Arg Gin He Phe Asn Gly Thr Val Val Gin Arg 

65 305 310 315 320 

His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 

325 330 335 
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Glu Glu Ala Leu Glu Ser Phe Val Glu Glu Trp Phe Lys Pro Leu Ala 
340 345 350 

Tyr Ala Leu Thr Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
5 355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 

10 Lys lie Asp Pro lie Leu Glu Ala Arg Gin Lys Tyr Ala Tyr Gly Arg 
385 390 395 400 



15 



30 



0 



5 



Gin Asn Asp Tyr Leu Asp His His Asn lie lie Gly Trp Thr Arg Glu 
405 410 415 

Gly Asn Thr Ala His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
420 425 430 



Gly Ala Gly Gly Asn Lys Trp Met Phe Val Gly Arg Asn Lys Ala Gly 

20 435 440 445 

Gin Val Trp Thr Asp lie Thr Gly Asn Arg Ala Gly Thr Val Thr lie 
450 ' 455 ' 460 

25 Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 



lie Trp Val Asn Lys 
485 



<210> 13 
<211> 485 
35 <212> PRT 

<213> Baci llus s p . 707 

<400> 13 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
0 1 5 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Asn Ser Asp Ala Ser 
20 25 30 

5 Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
35 40 45 



Lys Gly Ala Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 



Thr Arg Ser Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
5 85 90 95 

He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

3 Ala Thr Glu Met Val Arg Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 



Gin Glu Val Thr Gly Glu Tyr Thr He Glu Ala Trp Thr Arg Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Thr His Ser Ser Phe Lys Trp Arg Trp Tyr 

145 150 155 160 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Arg Leu Asn Asn Arg 
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165 170 175 

He Tyr Lys Phe Arg Gly His Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

^ Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp Tyr 
10 210 215 220 

Thr Asn Thr Leu Gly Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 

IS He Lys Tyr Ser Phe Thr Arg Asp Trp He Asn His Val Arg Ser Ala 

245 250 255 

Thr Gly Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala He Glu Asn Tyr Leu Gin Lys Thr Asn Trp Asn His Ser Val 

27.5 2_80 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly 
25 290 295 300 

Gly Asn Tyr Asp Met Arg Asn He Phe Asn Gly Thr Val Val Gin Arg 



20 



30 



35 



305 310 

His Pro Ser His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Glu Glu Ala Leu Glu Ser Phe Val Glu Glu Trp Phe Lys Pro Leu Ala 
340 345 350 

-T-yr-Ala-Leu_Thr-Leu^Thr^Arg_Glu„Gln_Gly_ .Tyr_ Pro Ser Val Phe Tyr 
355 360 . 365 

Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Arg Ser 
40 370 375 380 

Lys He Asp Pro He Leu Glu Ala Arg Gin Lys Tyr Ala Tyr Gly Lys 
385 390 395 400 

Gin Asn Asp Tyr Leu Asp His His Asn He He Gly Trp Thr Arg Glu 
405 410 415 

Glv Asn Thr Ala His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 

Gly Ala Gly Gly Ser Lys Trp Met Phe Val Gly Arg Asn Lys Ala Gly 
435 440 445 

Gin Val Trp Ser Asp He Thr Gly Asn Arg Thr Gly Thr Val Thr He 
55 450 455 460 

Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 4dU 

60 lie Trp Val Asn Lys 

485 



45 



50 



65 



<210> 14 
<211> 60 
<212> DNA 
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88 

<213> Artificial/Unknown 
<220> 

<221> misc_feature 
<222> ()..() 
<223> Primer FSA 



<400> 14 

caaaatcgta tctacaaatt cmrkrsyarg dvktgggatt sggaagtaga ttcggaaaat 6 0 



<210> 15 

<2riS 21 

25 <212> DNA 

<213> Artificial /Unknown 

30 

<220> 

<221> misc_feature 

35 <222> ()..() 

~<223~> Pri'mer~RSA ~ 

40 



<400> 15 

gaatttgtag atacgatttt g 21 

<210> 16 

<211> 24 

<212> DNA 

<213> Artificial /Unknown 



<220> 

60 <221> misc_feature 

<222> ()..() 

<223> Primer Bl 

65 



10004 .010-DK 



10 



20 



25 



30 



50 



<400> 16 

cgattgctga cgctgttatt tgcg 

<210> 17 

<211> 24 

<212> DNA 

<213> Artificial/Unknown 



15 <220> 

<221> misc_feature 

<222> ()..() 

<223> Primer Y2 



<400> 17 

cttgttccct tgtcagaacc aatg 



<210> 18 

<211> 30 

35 <212> DNA 

<213> Artfrfic^iar/ Unknown ■ 

40 

<220> 

<221> misc_f eature 

45 <222> ()..() 

<223> Primer 101458 
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230 


12 . 


.478 


48 . 


.268 


78 . 


.919 


1. 


.00 


25, 


.61 


6 


ATOM 


1872 


CG 


LEU 


A 


230 


13. 


.741 


48 , 


.428 


79. 


.759 


1, 


.00 


18 . 


.62 


6 


ATOM 


1873 
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LEU 


A 


230 


14 


.014 


47, 


.133 


80. 


.513 


1, 


.00 


27. 


.61 


6 


ATOM 


1874 
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230 


14 


.967 


48 


.696 


78 


.877 


1. 


.00 


28. 


.13 


6 
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ASP 
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231 


11 


.278 


48 


.365 


75 


.924 


1 . 


.00 


21, 


.79 


7 
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231 


10 


.246 


47 


.835 


75 


.043 


1. 


.00 


20 


.65 
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ASP 
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231 


10 


.110 


46 


.339 


75 


.292 


1. 


.00 


20 


.30 


6 


ATOM 


1878 


0 


ASP 


A 


231 


9 


.103 


45 


.756 


74 


.891 


1, 


.00 


20 


.54 


8 


ATOM 


1879 


CB 


ASP 


A 


231 


10 


.618 


48 


.043 


73 


.528 


1, 


.00 


21 


.94 


6 


ATOM 


1880 


CG 


ASP 


A 


231 


10 


.901 


49 


.525 


73 


.367 


1, 


.00 


13 


.94 


6 


ATOM 


1881 


ODl 


ASP 


A 


231 


. 9 


.942 


50 


.286 


73 


.503 


1 


.00 


24 


.69 


8 


ATOM 


1882 


0D2 


ASP 


A 


231 


12 


.054 


49 


.949 


73 


.183 


1 


.00 


20 


.63 


8 


ATOM 


1883 


N 


GLY 


A 


232 


11 


.122 


45 


.727 


75 


.946 


1 


.00 


18 


.68 


7 


ATOM 


1884 


CA 


GLY 


A 


232 


10 


.932 


44 


.300 


76 


.199 


1 


.00 


16 


.67 


6 


ATOM 


1885 


C 


GLY 


A 


232 


12 


.089 


43 


.727 


76 


,991 


1 


.00 


15 


.71 


6 


ATOM 


1886 


0 


GLY 


A 


232 


12 


.953 


44 


.439 


77 


.478 


1 


.00 


16 


.01 


8 


ATOM 


-1887 




PHE 


-A 


233 


1-2 
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A. 
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.-55 
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CA 


PHE 


A 


233 


13 


.180 


41 


.803 


77 


.919 


1 


. 00 


13 


.86 


6 


ATOM 


1889 


C 


PHE 


A 


233 


13 


.750 


40 


.550 


77 


.278 


1 


.00 


12 


.54 


6 


ATOM 


1890 


0 


PHE 


A 


233 


13 


. 006 


39 


.820 


76 


.615 


1 


.00 


12 


.13 


8 


ATOM 


1891 


CB 


PHE 


A 


233 


12 


.440 


41 


. 134 


79 


,160 


1 


.00 


12 


.94 


6 


ATOM 


1892 


CG 


PHE 


A 


233 


11 


.624 


42 


.107 


79 


.987 


1 


.00 


4 


.98 


6 


ATOM 


1893 


CDl 


PHE 


A 


233 


12 


.212 


43 


.177 


80 


.588 


1 


.00 


6 


.42 


6 


ATOM 


1894 


CD2 


PHE 


A 


233 


10 


.279 


41 


,877 


80 


.166 


1 


.00 


15 


-69 


6 


ATOM 


1895 


CEl 


PHE 


A 


233 


11 


.475 


44 


.065 


81 


.375 


1 


.00 


15 


.76 


6 


ATOM 


1896 


CE2 


PHE 


A 


233 


9 


.528 


42 


.742 


80 


.945 


1 


.00 


25 


.06 


6 


ATOM 


1897 


CZ 


PHE 


A 


233 


10 


.115 


43 


.834 


81 


.532 


1 


.00 


14 


.14 
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C 
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A 
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15 


.587 


38 


,314 


78 


.928 


1 


.00 


14 


.55 


6 


ATOM 


1901 


0 


ARG 


A 


234 


16 


.230 


38 


.935 


79 


.814 


1 


.00 


14 


,89 


8 


ATOM 


1902 


CB 


ARG 


A 


234 


16 


.899 


39 


.022 


76 


.882 


1 


.00 


5 


.93 


6 


ATOM 


1903 


CG 


ARG 


A 


234 


17 


.586 


37 


.656 


76 


.732 


1 


.00 


10 


.20 


6 


ATOM 


1904 


CD 


ARG 


A 


234 


18 


.540 


37 


.408 


77 


.899 


1 


.00 


15 


.00 


6 


ATOM 


1905 


NE 


ARG 


A 


234 


19 


.404 


36 


.232 


77 


,692 


1 


.00 


14 


.34 


7 


ATOM 


1906 


CZ 


ARG 


A 


234 


20 


.182 


35 


.688 


78 


.615 


1 


.00 


24 


.46 


6 


ATOM 


1907 


NHl 


ARG 


A 


234 


20 


.160 


36 


.173 


79 


.871 


1 


.00 


13 


,27 


7 


ATOM 


1908 


NH2 


ARG 


A 


234 


20 


.930 


34 


.647 


78 


.289 


1 


.00 


12 


,67 


7 


ATOM 


1909 


N 


ILE 


A 


235 


14 


.928 


37 


.217 


79 


.185 


1 


.00 


14 


.15 


7 


ATOM 


1910 


CA 


ILE 


A 


235 


14 


.874 


36 


.653 


80 


.533 


1 


.00 


12 


,69 


6 


ATOM 


1911 


C 


ILE 


A 


235 


15 


.873 


35 


.548 


80 


.793 


1 


.00 


12 


.06 


6 


ATOM 


1912 


0 


ILE 


A 


235 


15 


.773 


34 


.464 


80 


.203 


1 


.00 


12 


.54 


8 


ATOM 


1913 


CB 


ILE 


A 


235 


13 


.446 


36 


.187 


80 


.820 


1 


.00 


10 


.54 


6 


ATOM 


1914 


CGI 


ILE 


A 


235 


12 


.504 


37 


.385 


80 


.567 


1 


.00 


9 


.10 


6 


ATOM 


1915 


CG2 


ILE 


A 


235 


13 


.310 


35 


.831 


82 


.331 


1 


.00 


16 


.58 


6 


ATOM 


1916 


CDl 


ILE 


A 


235 


11 


.115 


36 


.763 


80 


.407 


1 


.00 


15 


,58 


6 


ATOM 


1917 


N 


ASP 


A 


236 


16 


.776 


35 


.785 


81 


.749 


1 


.00 


9 


.36 


7 


ATOM 


1918 


CA 


ASP 


A 


236 


17 


.781 


34 


.796 


82 


.093 


1 


.00 


10 


.49 


6 


ATOM 


1919 


C 


ASP 


A 


236 


17 


.441 


33 


.558 


82 


.860 


n 


.00 


12 


.64 


6 


ATOM 


1920 


O 


ASP 


A 


236 


16 


.607 


33 


.534 


83 


.759 


1 


.00 


15 


.01 


8 


ATOM 


1921 


CB 


ASP 


A 


236 


18 


.886 


35 


.533 


82 


.806 


1 


.00 


10 


.07 


6 


ATOM 


1922 


CG 


ASP 


A 


236 


20 


.154 


34 


.798 


83 


.084 


1 


.00 


20 


.01 


6 


ATOM 


1923 


ODl 


ASP 


A 


236 


20 


.882 


34 


.542 


82 


.112 


1 


.00 


14 


.97 


8 


ATOM 


1924 


0D2 


ASP 


A 


236 


20 


.455 


34 


.555 


84 


.310 


1 


.00 


22 


. 16 


8 


ATOM 


1925 


N 


ALA 


A 


237 


18 


.050 


32 


.429 


82 


.484 


1 


.00 


11 


.95 


7 


ATOM 


1926 


CA 


ALA 


A 


237 


18 


.011 


31 


.158 


83 


.169 


1 


.00 


12 


.87 
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ATOM 


1927 


C 


AliA 


A 


237 


16 


.653 


30 


.687 


83 


.598 


1 


. 00 


16 


. 11 


6 


ATOM 


1928 


O 


ALA 


A 


237 


16 


.440 


30 


.225 


84 


.762 


1 


.00 


15 


.77 


8 


ATOM 


1929 


CB 


AliA 


A 


237 


18 


.924 


31 


.424 


84 


.407 


1 


.00 


13 


.46 


6 


ATOM 


1930 


M 


VAL 


A 


238 


15 


.677 


30 


.602 


82 


.687 


1 


.00 


9 


.31 


7 


ATOM 


1931 


CA 


VAL 


A 


238 


14 


.320 


30 


.328 


83 


.035 


1 


.00 


7 


.91 


6 


ATOM 


1932 


C 


VAL 


A 


238 


14 


.133 


28 


.877 


83 


.391 


1 


.00 


9 


.01 


6 


ATOM 


1933 


0 


VAL 


A 


238 


13 


.113 


28 


.602 


84 


.037 


1 


.00 


10 


.54 


8 


ATOM 


1934 


CB 


VAL 


A 


238 


13 


.212: 


30 


.742 


82 


.045 


1 


. 00 


14 


. 17 


6 


ATOM 


1935 


CGI 


VAL 


A 


238 


13 


.273 


32 


.240 


81 


.788 


1 


. 00 


13 


.29 


6 


ATOM 


1936 


CG2 


VAL 


A 


238 


13 


.486 


29 


.997 


80 


.712 


1 


. 00 


15 


.01 


6 


ATOM 


1937 


N 


LYS 


A 


239 


15 


.066 


27 


.991 


83 


.077 


1 


.00 


5 


.59 


7 


ATOM 


1938 


CA 


LYS 


A 


239 


14 


.711 


26 


.629 


83 


.448 


1 


.00 


10 


.08 


6 


ATOM 


1939 


C 


LYS 


A 


239 


14 


.865 


26 


.363 


85 


.008 


1 


.00 


7 


.29 


6 


ATOM 


1940 


O 


LYS 


A 


239 


14 


.529 


25 


.276 


85 


.419 


1 


. 00 


7 


.91 


8 


ATOM 


1941 


CB 


LYS 


A 


239 


15 


.636 


25 


.685 


82 


.762 


1 


. 00 


12 


.33 


6 


ATOM 


1942 


CG 


LYS 


A 


239 


.17 


.092 


25 


.584 


82 


.801 


1 


.00 


25 


.74 


6 


ATOM 


1943 


CD 


LYS 


A 


239 


17 


.278 


24 


.453 


81 


.732 


1 


.00 


25 


.18 


6 


ATOM 


1944 


CE 


LYS 


A 


239 


18 


.708 


24 


.141 


81 


.578 


1 


. 00 


26 


.14 


6 


ATOM 


1945 


NZ 


LYS 


A 


239 


19 


.049 


22 , 


.792 


81 


.090 


1 


. 00 


29 


.06 


7 


ATOM 


1946 


N 


HIS 


A 


240 


15 


.615 


27. 


.232 


85 


.621 


1 


. 00 


8 


.92 


7 


ATOM 


1947 


CA 


HIS 


A 


240 


15 


.904 


27. 


.175 


87 


.047 


1 


.00 


14 


.13 


6 


-ATOM 


-1-948 


G 


H-I-S- 


-A 


-240 


1-4- 


rSSO 


28- 


.-056 


87 


.743 


1 


. 00 


rs 


.09 


"6 


ATOM 


1949 


O 


HIS 


A 


240 


15 


.029 


28, 


.277 


88 


.936 


1 


.00 


10 


.80 


8 


ATOM 


1950 


CB 


HIS 


A 


240 


17 


.327 


27, 


.746 


87 


.264 


1 


.00 


10 


.84 


6 


ATOM 


1951 


CG 


HIS 


A 


240 


18 


.312 


27. 


.095 


86 


.328 


1 


.00 


7 


. 11 


6 


ATOM 


1952 


NDl 


HIS 


A 


240 


18 


.625 


25, 


.766 


86 


.369 


1 


.00 


6 


.30 


7 


ATOM 


1953 


CD2 


HIS 


A 


240 


18 


.997 


27. 


.608 


85 


.226 


1 


. 00 


9 


.29 


6 


ATOM 


1954 


CEl 


HIS 


A 


240 


19 


.524 


25, 


.452 


85 


.416 


1 


.00 


10 


.25 


6 


ATOM 


1955 


NE2 


HIS 


A 


240 


19 


.767 


26. 


,568 


84 


,761 


1 


.00 


6 


.19 


7 


ATOM 


1956 


N 


ILE 


A 


241 


13 


.855 


28. 


,688 


87 


.174 


1 


.00 


11, 


.07 


7 


ATOM 


1957 


CA 


ILE 


A 


241 


12 


.949 


29. 


591 


87 


.887 


1 


.00 


9, 


.64 


6 


ATOM 


1958 


C 


ILE 


A 


241 


11. 


.555 


29. 


.021 


87 


.800 


1 


.00 


14, 


.21 


6 


ATOM 


1959 


O 


ILE 


A 


241 


11 


.114 


28. 


540 


86 


.713 


1 


.00 


9, 


.89 


8 


ATOM 


1960 


j::b„ 


JLE_ 


^A. 


.241 


_ 13-, 


.-0 14- - 


30-. 


^983- 


—8-7- 


.274- 


— 1, 


.-00 


-iO. 


.-1-2— 


-6 


ATOM 


1961 


CGI 


ILE 


A 


241 


14, 


.389 


31. 


615 


87 


.488 


1 , 


.00 


7. 


.20 


6 


ATOM 


1962 


CG2 


ILE 


A 


241 


11, 


.921 


31. 


942 


87 


.733 


1, 


.00 


13, 


.70 


6 


ATOM. 


1963 


CDl 


ILE 


A 


241 


14. 


.592 


32. 


912 


86 


.714 


1 , 


.00 


8 . 


.67 


6 


ATOM 


1964 


N 


LYS 


A 


242 


10. 


.790 


29. 


009 


88 . 


.919 


1 . 


.00 


11 . 


.24 


7 


ATOM 


1965 


CA 


LYS 


A 


242 


9, 


.482 


28. 


353 


88 , 


.886 


1 . 


.00 


11 . 


,43 


6 


ATOM 


1966 


C 


LYS 


A 


242 


8. 


.667 


28. 


791 


87, 


.642 


1 . 


.00 


11 . 


,92 


6 


ATOM 


1967 


0 


LYS 


A 


242 


8. 


.506 


29. 


992 


87, 


.454 


1 . 


.00 


12 . 


,59 


8 


ATOM 


1968 


CB 


LYS 


A 


242 


8 . 


.805 


28. 


706 


90. 


.226 


1 , 


.00 


9. 


80 


6 


ATOM 


1969 


CG 


LYS 


A 


242 


7, 


.375 


28. 


212 


90. 


.391 


1 . 


.00 


14 . 


60 


6 


ATOM 


1970 


CD 


LYS 


A 


242 


6. 


.822 


28. 


881 


91. 


.688 


1 . 


.00 


23 . 


88 


6 


ATOM 


1971 


CE 


LYS 


A 


242 


5, 


.656 


28. 


032 


92. 


.185 


1 , 


,00 


30. 


29 


6 


ATOM 


1972 


NZ 


LYS 


A 


242 


5, 


.228 


28. 


446 


93 . 


.546 


1, 


,00 


51. 


79 


7 


ATOM 


1973 


N 


TYR 


A 


243 


8. 


.207 


27. 


886 


86 . 


.794 


1 , 


00 


12 . 


14 


7 


ATOM 


1974 


CA 


TYR 


A 


243 


7. 


.591 


28 . 


335 


85. 


.513 


1 . 


00 


12 . 


18 


6 


ATOM 


1975 


C 


TYR 


A 


243 


6. 


,430 


29. 


283 


85. 


.699 


1. 


00 


13. 


50 


6 


ATOM 


1976 


O 


TYR 


A 


243 


6, 


349 


30. 


392 


85. 


.138 


1. 


00 


10. 


33 


8 


ATOM 


1977 


CB 


TYR 


A 


243 


7. 


177 


27. 


155 


84 . 


.639 


1. 


00 


11 . 


39 


6 


ATOM 


1978 


CG 


TYR 


A 


243 


8. 


272 


26. 


205 


84 . 


.196 


1 . 


00 


8. 


33 


6 


ATOM 


1979 


CDl 


TYR 


A 


243 


9. 


605 


26. 


506 


84. 


.310 


1. 


00 


9. 


57 


6 


ATOM 


1980 


CD2 


TYR 


A. 


243 


7. 


937 


24. 


971 


83. 


661 


1. 


00 


6. 


42 


6 


ATOM 


1981 


CEl 


TYR 


A 


243 


10. 


610 


25. 


624 


83. 


980 


1. 


00 


7. 


83 


6 


ATOM 


1982 


CE2 


TYR 


A 


243 


8. 


937 


24 . 


099 


83 . 


234 


1. 


00 


6. 


98 


6 


ATOM 


1983 


cz 


TYR 


A 


243 


10. 


260 


24. 


446 


83 . 


402 


1 . 


00 


7. 


87 


6 


ATOM 


1984 


OH 


TYR 


A 


243 


11. 


213 


23. 


503 


83 . 


006 


1 . 


00 


8. 


46 


8 


ATOM 


1985 


N 


SER 


A 


244 


5. 


514 


28. 


975 


86 . 


659 


1 . 


00 


12. 


18 


7 


ATOM 


1986 


CA 


SER 


A 


244 


4. 


381 


29. 


873 


86. 


861 


1 . 


00 


11 . 


90 


6 


ATOM 


1987 


C 


SER 


A 


244 


4 . 


708 


31. 


236 


87. 


348 


1 . 


00 


12. 


87 
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ATOM 


1988 
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SER 


A 


244 


ATOM 


1989 


CB 


SER 


A 


244 
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OG 
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A 


244 


ATOM 


1991 
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PHE 


A 


245 


ATOM 


1992 
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PHE 


A 
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1993 


C 


PHE 


A 
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ATOM 


1994 


0 


PHE 


A 
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ATOM 


1995 


CB 


PHE 


A 


245 


ATOM 


1996 


CG 


PHE 


A 


245 


ATOM 


1997 


GDI 


PHE 


A 


245 


ATOM 


1998 


CD2 


PHE 


A 


245 


ATOM 


1999 


CEl 


PHE 


A 


245 


ATOM 


2000 
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2001 
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2002 
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2003 
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2004 


C 


THR 
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2005 


0 


THR 
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2006 


CB 


THR 


A 


246 


ATOM 


2007 
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2011 
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2012 
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ARG 
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2013 


CB ■ 


ARG 


A 
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ATOM 


2014 


CG 


ARG 


A 
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ATOM 


2015 


CD 


ARG 


A 
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ATOM 


2016 


NE 


ARG 


A 


247 


ATOM 


2017 


CZ 


ARG 


A 


247 


ATOM 


2018 


NHl 


ARG 


A 


247 


ATOM 


2019 


NH2 


ARG 


A 


247 


ATOM 


2020 


N 


ASP 


A 


248 


ATOM 


2021 


CA 


ASP 


A 


248 


ATOM 


2022 


C 


ASP 


A 


248 


ATOM 


2023 


0 


ASP 


A 


248 


ATOM 


2024 


CB 


ASP 


A 


248 


ATOM 


2025 


CG 


ASP 


A 


248 


ATOM 


2026 


ODl 


ASP 


A 


248 


ATOM 


2027 


0D2 


ASP 


A 


248 


ATOM 


2028 


N 


TRP 


A 


249 


ATOM 


2029 


CA 


TRP 


A 


249 


ATOM 


2030 


C 


TRP 


A 


249 


ATOM 
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